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Improvements In or Relating to Protein Production 
Background 

The " present invention relates to a method for producing 
5 heterologous glycosylated- proteins in bryophyte cells, such as 
in transformed Physcomitrella patens cells in culture. In 
particular, the method relates to a method for producing 
glycosylated proteins comprising animal glycosylation patterns, 
such as pharmaceutical proteins for use in mammals, including 
10 humans , in bryophyte cells such as Physcomitrella patens cells, 
the genetic material required therefore, such as DNA and RNA, 
vectors, host cells, methods of introduction of genetic material 
there into, and uses thereof. 

15 In the past, heterologous proteins have been produced using a 
variety of transformed cell systems, such as those derived from 
bacteria, fungi, such as yeasts, insect, plant or mammalian cell 
lines (Kudo T. 1994, In: Y . Murooka and T. Amanaka (Eds.) 
Recombinant microbes for industrial and agricultural 

20 applications, pp. 291-299, Marcel Dekker, New York; Harashima - t 
S., Bioproc. Technol. 1994, 19: 137-158; Archer D.B. 1994, In: 
Y. Murooka and T. Amanaka (Eds.) Recombinant microbes for 
industrial and agricultural applications, pp. 373-393, Marcel 
Dekker, New York; Goosen M.F.A. 1993, In: M.F.A. Goosen, A. 

25 Baugulis and P. Faulkner (Eds.) Insect cell culture engineering, 
pp. 1-16, Marcel Dekker, New York; Hesse F. & Wagner R. , Trends 
in Biotechnol. 2000, 18(4): 173-180). 

Proteins produced in prokaryotic organisms may not be post- 
30 translationally modified in a similar manner to that of 
eukaryotic proteins produced in eukaryotic systems, e.g. they 
may not be glycosylated with appropriate sugars at particular 
amino acid residues, such as aspartic acid (N) residues (N- 
linked glycosylation). Furthermore, folding of bacterially- 
35 produced eukaryotic proteins may be inappropriate due to, for 
example, the inability of the bacterium to form cysteine 
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• disulfide bridges. Moreover, bacterially-produced recombinant 
proteins frequently aggregate and accumulate as insoluble 
inclusion bodies. 

5 Eukaryotic cell systems are better suited for the production of 
glycosylated proteins found in various eukaryotic organisms, 
such as humans, since- such cell systems may effect post- 
translational modifications, such as glycosylation of produced 
proteins. However, a problem encountered in eukaryotic cell 

10 systems which have been transformed with heterologous genes 
suitable for the production of protein sequences destined for 
use, for example, as pharmaceuticals, is that the glycosylation 
pattern on such proteins often acquires a native pattern, that 
is, of the eukaryotic cell system in which the protein has been 

15 produced: glycosylated proteins are produced that comprise non- 
animal glycosylation patterns and these in turn may be 
immunogenic and/or allergenic if applied In animals, including 
humans . 

20 The use of recombinant glycoproteins produced by higher plants 
is limited by the plant-specific N-glycosylation that is 
acquired on such proteins. Compared to mammalian-derived 
glycoproteins, higher plant-specific glycoproteins contain two 
additional residues. Moreover, in higher plant glycoproteins 

25 terminal beta 1, 4-galactose residues are not found, indicating 
that a beta 1, 4-galactosyltransf erase is not present in plants. 
Stable integration and expression of this enzyme in tobacco 
plants (Bakker et al . (2001) Proc Natl Acad Sci USA, 98 ,2899- 
2904) as well as in tobacco BY2 cells (Palacpac et al . (1999) 

30 Proc Natl Acad Sci USA 96, 4692-4697) has been described. The 
recombinant human beta 1, 4-galactosyltransferase was functional 
and proteins isolated from transgenic material exhibited 
terminal beta 1, 4-galactose residues. Nevertheless, in higher 

x * plants it is not thought possible to suppress the activities of 

35 beta 1, 2-xylosyltransf erase and alpha 1, 3-fucosyltransf erase, 
the two enzymes that are considered responsible for transferring 
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the additional, plant-specific residues. These residues are 
considered to be allergenic for humans (Garcia-Cassado et al. 
(1996) Glycobiology 6, 471-477; van Ree et al . 2000, J. Biol. 
. Chem. 275, No . 15, 11451-11458 ) . All data on plant-specific N- 
5 glycosylation has been generated in studies with higher plants. 

The bryophyte, Physcomitrella patens, a haploid non-vascular 
land plant, can also be used for the production of recombinant 
proteins (WO 01/25456) . 

10 

The life cycle of mosses is dominated by photoautotrophic 
gametophytic generation. The life cycle is. completely different 
to that of the higher plants wherein the sporophyte is the 
dominant generation and there are notably many differences to be 
15 observed between higher plants and mosses. 

The gametophyte of mosses is characterised by two distinct 
developmental stages. The protonema which develpps via apical 
growth, grows into a filamentous network of only two cell types 

20 (chloronemal and caulonemal cells) . The second stage, called the , 
gametophore, differentiates by caulinary growth from a simple 
apical system. Both stages are photoautotrophically active.- 
Cultivation of protonema. without differentiation into the more 
complex gametophore has been shown for suspension cultures in 

25 flasks as well as for bioreactor cultures (WO 01/25456). 
Cultivation of fully differentiated and photoautrophically 
active multicellullar tissue containing- only a few cell types is 
not described for higher plants. The genetic stability of the 
moss cell system provides an important advantage over plant cell 

30 cultures and the stability of photoautotrophically active bryo- 
phyte cultures has been confirmed (Rieck 1996, Struktur- 
aufkiarung und stereochemische Untersuchungen von Sesquiterpenen 
als Inhaltsstoffe atherischer Ole. Ph.D. thesis, Hamburg Uni- 
versity) . In cell cultures of higher plants the secondary meta- 

35 holism is- more differentiated and this results in differences in 
secondary metabolite profiles. 
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In addition, there are some important differences between mosses 
and higher plants on the biochemical level. Sulfate assimilation 
in. Physcomltrella patens differs significantly from that in 
higher plants. The key enzyme of sulfate assimilation in higher 
5 plants is adenosine 5 ' -phosphosulf ate reductase. In Physcomi- 
trella patens an alternative pathway via phosphoadenosine 5 1 - 
phosphosulf ate reductase co-exists (Koprivova et al. (2002) J.. 
Biol. Chem. 277,32195-32201). .This pathway has not been 
characterised in higher plants. 

10 

Furthermore, many members of the moss, algae and fern families 
produce a wide range of polyunsaturated fatty acids (Dembitsky 
(1993) Prog. Lipid Res. 32, 281-356) . For example, arachidonic 
.acid and eicosapentaenoic acid are thought to be produced only 

15 by lower plants and not by higher plants. Some enzymes of the 
metabolism of polyunsaturated fatty acids, (delta 6-acyl-group 
desaturase) (Girke et al. (1998), Plant J, 15, 39-48) and a 
component of a delta 6 elongase (Zank et al. (2002) Plant J 31, 
255-268), have been cloned, from Physcomitrella patens. No 

20 corresponding genes have been found in higher plants. This fact 
appears to confirm that essential differences exist between 
higher plants and lower plants at the biochemical level. 

• Further differences are reflected in the regeneration of the 
25 cell wall. Protoplasts derived from higher plants regenerate new 
cell walls in a rapid manner, independently of the culture 
medium. Direct transfer of DNA via polyethylene glycol (PEG) 
into protoplasts of higher plants requires pre-incubation at 4 
to 10 °C .to slow down the process of cell wall regeneration (US 
30 Patent 5508184) . In contrast, cell wall regeneration of proto- 
plasts derived from protonema of Physcomitrella is dependent on 
culture medium. Protoplasts can be cultivated without regene- 
ration of the cell wall over long periods . Without the intention 
of being bound by theory, it appears that the secretion 
35 machinery of the moss protoplast, essential for cell wall 
regeneration and protein glycosylation, differs from that of 
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higher plants. Moreover, Physcomitrella patens shows highly 
efficient homologous recombination in its nuclear DNA, a unique 
feature for plants, which enables directed gene disruption 
(Girke et al. (1998) Plant J, 15, 39-48; Strepp et al. (1998) 
5 Proc Natl Acad Sci USA 95,4368-4373; Koprivova (2002) J. Biol. 
Chem.277, 32195-32201; reviewed by Reski (1999) Planta 208, 301- 
309; Schaefer and Zryd (2001) Plant Phys 127, 1430-1438; 
Schaefer (2002) Anna. Rev. Plant BioJ.53, 477-501) further 
illustrating fundamental differences to higher plants. However, 

10 the use of this mechanism for altering glycosylation patterns 
has proven to be problematic, as shown herein in the examples. 
Disruption of N-acetylglucosaminyltransf erase I (GNTl) in 
Physcomitrella patens resulted in the loss of the specific 
transcript but only in minor differences of the N-glycosylation 

15 pattern. These results were in direct contrast to the loss of 
Golgi -modified complex glycans in a mutant Arabidopsis thallana 
plant lacking GNTl observed by von Schaewen et al. (1993) Plant 
Physiol 102, 1109-1118). Thus, the knockout in Physcomitrella 
patens did not result in the expected modification of the N- 

20 glycosylation pattern. 

Although the knockout strategy was not successful for GNTl, the 
present inventors attempted to knock out the beta 1,2-xylosyl- 
transferase (XylT) and alpha 1, 3-fucosyltransf erase (FucT) in 

25 Physcomitrella patens. Specific transcripts could not be 
detected in the resulting plants. Surprisingly, the N-linked 
. glycans isolated from the transgenic plants were found to be 
modified in the' desired manner. No 1,3 linked fucosyl residues 
could be detected on N-linked glycans of FucT knockout plants 

30 and no 1,2 linked xylosyl residues could be detected on N-linked 
glycans of XylT knockout plants. The isolated transgenic lines 
showed normal growth which is surprising considering that plant 
specific glycosylation is highly conserved and therefore would 
be expected to be significant for function. Double knockouts 

35 should therefore have been expected to have had a detrimental 
effect on the growth of the moss*. In addition, compensating were 
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expected but surprisingly were not apparent. Moreover, the 
double knockout of FucT and XylT resulted in modified N-linked 
gl yeans without detectable 1,3 linked fucosyl and 1,2 linked 
xylosyl residues. 

Integration of the human beta 1, 4-galactosyltransf erase into the 
genome of a double knockout Physcomltrella patens plant resulted 
in a mammalian-like N-linked glycosylation pattern without the 
plant specific fucosyl and xylosyl residues and with mammalian- 
like terminal 1,4 galactosyl residues. The galactosyltransf erase 
was found to be active. Such a modification of N-linked glycans 
without loss of viability was not expected because N-glyco- 
sylation is very complex and well regulated. It is not only 
dependent on developmental stages (for plants: Elbers et al . 
(2001) Plant Phys 126, 1314-1322) but also dependent on culture 
conditions (for mammalian cell culture: Hills et al. (2001). 
Biotechnol. Bioeng. 75, 239-251). 

It is an object of the present invention to provide a more, 
efficient method of producing animal proteins comprising animal 
glycosylation patterns, and in particular, glycosylated human 
proteins comprising human glycosylation patterns thereon. It is 
a further object to provide an efficient process for the 
production of heterologous animal proteins comprising animal 
glycosylation patterns, particularly human proteins comprising 
human glycosylation patterns in bryophytes, such as 
Physcomltrella patens. 

These and other objects will become apparent from the following 
description and examples provided herein. 

Detailed Description 

According to the present invention there is provided a 
transformed bryophyte cell that comprises i) a dysfunctional 
fucosyl transferase nucleotide sequence and ii) a dysfunctional 
xylosyl transferase nucleotide sequence. 
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The bryophyte cell of the invention is one from a moss selected 
from the group mosses including liverworts., of species from the 
genera Physcomitrella r Funaria, Sphagnum, Ceratodon, Marchantia 
and Sphaerocarpos . The bryophyte cell is preferably from 
5 Physcomitrella patens. 

The bryophyte cell, such as a Physcomitrella patens cell, can be 
any cell suitable for transformation according to methods of the 
invention as described herein, and may be a moss protoplast 
10 cell, a cell found in protonema tissue or other cell type. 
Indeed, the skilled addressee will appreciate that moss plant 
tissue comprising populations of transformed bryophyte cells 
according to the invention, such as transformed protonemal 
tissue also forms an aspect of the present invention. 

15 

"Dysfunctional" as used herein means that the nominated trans- 
fer rase nucleotide sequences of fucosyl transferase (fucT) and 
xylosyl transferase (xylT) are substantially incapable of 
encoding niRNA that codes for functional fucT and xylT proteins 

20 that are capable of modifying plant N-linked glycans with plant- 
like glycoslation patterns comprising 1,3 linked fucosyl and 1,-2 
"linked xylosyl residues. In a preferment, the dysfunctional fucT 
and xylT plant transferase nucleotide sequences comprise 
targeted insertions of exogenous nucleotide sequences into 

25 endogenous, that is genomic, native fucT and xylT genes 
comprised in the nuclear bryophyte genome (whether it is a truly 
native bryophyte genome, that is in bryophyte cells that have 
not been transformed previously by man with other nucleic acid 
sequences, or in a transformed nuclear bryophyte genome in which 

30 nucleic acid sequence insertions have been made previously of 
desired nucleic acid sequences) which substantially inhibits or 
represses the transcription of mRNA coding for functional fucT 
and xylT transferase activity. 

35 In a further preferment, the dysfunctional fucT and xylT plant 
transferase nucleotide sequences may comprise targeted deletions 
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of the whole or substantially the whole endogenous gene 
sequences thereof, or indeed, targeted partial deletions of the 
endogenous gene sequences of fucT and - xylT nucleotide sequences, 
thus rendering such sequences dysfunctional within the context 
5 of the invention. 

Bryophyte cells of the invention or ancestors thereof may be any 
which have been transformed previously with heterologous genes 
of interest that code for primary sequences of proteins of 

10 interest which are glycosylated with mammalian glycosylation 
patterns as described herein. Preferably, the. glycosylation 
patterns are of the human type. Alternatively, the bryophyte 
cell may be transformed severally, that is, simultaneously or 
over time with nucleotide sequences coding for at least a 

15 primary protein sequence of interest, typically at least a 
pharmaceutical protein of interest for use in humans or mammals 
such as livestock species including bovine, ovine, equine and 
porcine species, that require mammalian glycosylation patterns 
to be placed on them in accordance with the methods of the 

20 invention as described herein. Such pharmaceutical glycoproteins 
for use in mammals, including man include but are not limited to 
proteins- such as insulin, preproinsulin, VEGF, proinsulin, 
glucagon, interferons such as alpha-interf eron, beta-interf eron, 
gamma-interferon, blood-clotting factors selected from Factor 

25 VII, VIII, IX, X, XI, and XII, fertility hormones including 
luteinising hormone, follicle stimulating hormone growth factors 
including epidermal growth factor, platelet-derived growth 
factor, granulocyte colony stimulating factor and the like, 
prolactin, oxytocin, thyroid stimulating hormone, adrenocortico- 

30 tropic hormone, calcitonin, parathyroid hormone, somatostatin, 
erythropoietin (EPO) , enzymes such as beta-glucocerebrosidase, 
haemoglobin, serum albumin, collagen, fusion proteins such as the 
fusion protein of TNF alpha receptor ligand binding domain with 
Fc portion of IgG and the like. Furthermore, the method of the 

35 invention can be used for the production of antibodies such as 
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specific monoclonal antibodies or active fragments thereof. 

In a preferment, there is provided a transformed bryophyte cell 
that comprises i) a dysfunctional fucosyl transferase nucleotide 
5 sequence and ii) a dysfunctional xylosyl transferase nucleotide 
sequence and iii) a nucleotide sequence operably linked to an 
exogenous promoter that drives expression in the said bryophyte 
cell wherein said nucleotide sequence encodes a functional 
mammalian galactosyl transferase that is expressed in the 
10 bryophyte cell. 

In a further preferment, the mammalian galactosyl transferase, 
nucleotide sequence is a beta-1,4 galactosyl transferase (beta- 
1,4 galT) and most preferably is a human beta-1,4 galactosyl 

15 . transferase nucleotide sequence. The skilled addressee will 
appreciate that the "beta-1,4 galT nucleotide sequence may be a 
cDNA sequence or a genomic DNA sequence and may comprise a 
degeneratively equivalent nucleotide beta-1,4 galT sequence as 
long as the N-glycan glycosylation pattern on any desired 

20 glycosylated exogenous protein produced in the transformed 
bryophyte cells or bryophyte tissue- of the invention is 
substantially mammalian in pattern, if not completely mammalian 
in pattern, and most - preferably, where appropriate, is human in 
pattern . 

25 

Detailed information on the. culturing of mosses which are 
suitable for use in the invention, such as Leptobryum pyri forme 
and Sphagnum magellanicum in bioreactors, is known in the prior 
art (see, for example, E. Wilbert, "Biotechnological studies 

30 concerning the mass culture of mosses with particular 
consideration of the arachidonic acid metabolism", Ph.D. thesis, 
University of Mainz (1991); H. Rudolph and S. Rasmussen, Studies 
on secondary metabolism of Sphagnum cultivated in bioreactors, 
Crypt. Bot., 3, pp. 67-73 (1992)). Especially preferred for the 

35 purposes of the present invention is the use of Physcomitrella 
patens, since molecular biology techniques are practised on this 
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organism (for a review see R. Reski, Development, genetics and 
molecular biology of mosses, Bot. Acta, 111", pp. 1-15 (1998)). 

Suitable transformation systems have been developed for the 
5 biotechnological exploitation of Physcomitrella for the 
production of heterologous proteins. For example, successful 
transformations have been carried out by direct DNA transfer 
into protonema tissue using particle guns. PEG-mediated DNA' 
transfer into moss protoplasts has also been successfully 

10 achieved. The PEG-mediated transformation method has been 
described many times for Physcomitrella patens and leads both to 
transient and to stable transf ormants (see, for example, K. 
Reutter and R. Reski, Production of a heterologous protein in 
bioreactor cultures of fully differentiated moss plants, PI. 

15 Tissue culture and Biotech., 2, pp. 142-147 (1996)). 

In a further embodiment of the . present invention there is 
provided a method of producing at least a bryophyte cell . wherein 
fucT and xylT activity is substantially reduced that, comprises 
20 introducing into the said cell i) a first nucleic acid sequence 
that is specifically targeted to an endogenous fucosyl 
transferase nucleotide sequence and ii) a second nucleic acid 
sequence that is specifically targeted to an endogenous xylosyl 
transferase nucleotide sequence. 

25 

The skilled addressee will appreciate that the order of intro- 
duction of said first and second transferase nucleic acid 
sequences into the bryophyte cell is not important: it can be 
performed in any order. The first and second nucleic acid 

30 sequences can be targeted to specific portions of the 
endogenous, native fucT and xylT genes located in the nuclear 
genome of the bryophyte cell defined by specific restriction 
enzyme sites thereof, for example, according to the examples as 
provided herein. By specifically targeting the sequences of the 

35 native fucT and xylT transferase genes with nucleotide sequences 
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that specifically integrate with the target native transferase 
genes of interest, the expression of the said sequences is 
substantially impaired if not completely disrupted. 

5 In a preferred embodiment of the present invention there is 

provided a method of producing at least a heterologous or 

exogenous glycosylated mammalian protein in a transformed 
bryophyte cell that comprises: 

10 i) introducing into said cell a first isolated nucleic acid 
that comprises a first nucleic acid sequence that is 
specifically targeted to an endogenous fucosyl transferase 
nucleotide sequence; and 

15 ii) introducing into said cell a second nucleic acid sequence 
that is specifically targeted to an endogenous xylosyl 
transferase nucleotide sequence; and 

iii) introducing into said cell a third isolated nucleic acid 
20 sequence that comprises, nucleic acid operably linked to an 

exogenous promoter that drives expression in a bryophyte 
cell wherein - said nucleic acid encodes at least one 
mammalian galactosyl transferase polypeptide. 

25. In this embodiment of the invention the at least one mammalian 
galactosyl transferase polypeptide is preferably .a beta-1,4 
galactosyl transferase (beta-1,4 galT) and most preferably is a 
human beta-1,4 galactosyl transferase nucleotide sequence. 

30 Preferably all. glycosylated mammalian proteins mentioned herein- 
above are of the human type . Other proteins that are contem- 
plated for production in the present invention include proteins 
for use in veterinary care and may correspond to animal 
homologues of the human proteins mentioned herein. 



35 



An exogenous promoter is one that denotes a promoter that is 
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introduced in front of a nucleic acid sequence of interest and 
is operably associated therewith. Thus an- exogenous promoter is 
one that has been placed in front of a selected nucleic acid 
component as herein defined and does not consist of the natural 
5 or native promoter usually associated with the nucleic acid 
component of interest as found in wild type circumstances. Thus 
a promoter may be native to a bryophyte cell of interest but may 
not be operably associated with the nucleic acid of interest in 
front in wild- type bryophyte cells. Typically, an exogenous 
10 promoter is one that is transferred to • a host bryophyte cell 
from a source other than the host cell. 

The cDNA's encoding the galT proteins, the glycosylated and the 
mammalian proteins as described herein contain at least one type 

15. of promoter that is operable in a bryophyte cell, for example, 
an inducible or a constitutive promoter operatively linked to a 
galT nucleic acid sequence and/or second nucleic acid sequence 
for a glycosylated mammalian protein as herein defined and as 
provided by the present invention. As discussed, this enables 

.20 control of expression of the gene(s). 

The term "inducible" as applied to a promoter is well understood 
by those skilled in the art- In essence, expression under the 
control of an inducible promoter is "switched on" or increased 

25 in response to an applied stimulus (which may be generated 
within a cell or provided exogenously) . The nature of the 
stimulus varies between promoters. Some inducible promoters 
cause little or undetectable levels of expression (or no 
expression) in the absence of the appropriate stimulus. Other 

30 inducible promoters cause detectable constitutive expression in 
the absence of the stimulus. Whatever the level of expression is 
in the absence of the stimulus, expression from any inducible 
promoter is increased in the presence of the correct stimulus. 
The preferable situation is where the level of expression 

35 increases upon application of the relevant stimulus by an amount 
effective to alter a phenotypic characteristic. Thus an 
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inducible (or " s wit enable " ) promoter may be used which causes a 
basic level of expression in the absence of the stimulus which 
level is too low to bring about a desired phenotype (and may in 
fact be zero) Upon application of the stimulus, expression is 
5 increased (or switched on) to a level, which brings about the 
desired phenotype. 

As alluded to herein, bryophyte expression systems are also 
known to the man skilled in the art. A bryophyte promoter, in 

10 particular a Physcomitrella patens promoter, is any DNA sequence 
capable of binding a host DNA-dependent RNA polymerase and 
initiating the downstream (3 1 ) transcription of a coding 
sequence (e.g. structural gene)- into mRNA. A promoter will have 
a transcription initiation region which is usually placed 

15 proximal to the 5' end of the coding sequence. This 
transcription initiation region usually includes an RNA 
polymerase binding site . (the. "TATA Box") and. a transcription 
initiation site. A bryophyte promoter may also have a second 
domain called an upstream activator sequence (OAS) , which, if 

20 present, is usually distal to the structural gene. The UAS 
permits regulated (inducible) expression. Constitutive 
expression occurs in the absence of a UAS. Regulated expression 
may be either positive or negative, thereby either enhancing or 
reducing transcription.. 

The skilled addressee will appreciate that bryophyte promoter 
sequences encoding enzymes in bryophyte metabolic pathways can 
provide particularly useful promoter sequences. 

30 In addition, synthetic promoters which do not occur in nature 
may also function as bryophyte promoters. For example, UAS 
sequences of one byrophyte promoter may be joined with the 
transcription activation region of another bryophyte promoter, 
creating a synthetic hybrid promoter. An example of a suitable 

35 promoter is the one used in the TOP 10 expression system for 
Physcomitrella patens by Zeidler et al. (1996) Plant. Mol. Biol. 
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30, 199-205). Furthermore, a bryophyte promoter can include 
naturally occurring promoters of non-bryophyte origin that have 
the ability to bind a bryophyte DNA-dependent RNA polymerase and 
initiate transcription. Examples of such promoters include those 
5 .described, inter alia, the rice P-Actin 1 promoter and the 
Chlamydomonas RbcS promoter (Zeidler et al . (1999) J. Plant 
Physiol. 154, 641-650), Cohen et al., Proc. Natl. Acad. Sci. 
USA, 77 : 1078,. 1980; Henikoff et al . , Nature, 283: 835, 1981; 
Hollenberg et al., Curr. Topics Microbiol. Immunol., 96: 119, 

10 198 1; Hollenberg et al., "The Expression of Bacterial Anti- 
biotic Resistance Genes in the Yeast Saccharomyces cerevisiae" , 
in: Plasmids of Medical, Environmental and Commercial Importance 
(eds. K. N. Timms and A. Puhler) , 1979; Mercerau-Puigalon et 
al., Gene, 1 1: 163, 1980; Panthier et al . , Curr. Genet., 2: 

15 109, 1980. 

A DNA molecule may be expressed intracellularly in bryophytes. A 
promoter sequence may be directly linked with the DNA molecule, 
in which case the first amino acid at the N-terminus of the 
20 recombinant protein will always be a methionine, which is 
encoded by the AUG start codon on the mRNA. If desired, 
methionine at the N-terminus may be cleaved from the' protein by 
in vitro incubation with cyanogen bromide . 

25 Alternatively, foreign proteins can also be secreted from the 
bryophyte cell into the growth media by creating chimeric DNA 
molecules that encode a fusion protein comprised of a leader 
sequence fragment that provides for secretion in or out of 
bryophyte cells of the foreign protein. Preferably, there are 

30 processing sites encoded between the leader fragment and the 
foreign gene that can be cleaved either in vivo or in vitro. The 
leader sequence fragment usually encodes a signal peptide 
comprised of hydrophobic amino acids which direct the secretion 
of the protein from the cell. 

35 

DNA encoding suitable signal sequences can be derived from genes 
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for secreted bryophyte proteins , such as leaders of non- 

bryophyte origin, such as a VEGF leader, exist that may also 
provide for secretion in bryophyte cells. 

5 Transcription termination sequences that are recognized by and 
functional in bryophyte cells are regulatory regions located 3 f 
to the translation stop codon, and thus together with the 
promoter flank the coding sequence. These sequences direct the 
transcription of an mRNA which can be translated ' into the 
10 polypeptide encoded by the DNA. An example of a suitable 
termination sequence that works in Physcomitrella patens is the 
termination region of Cauliflower mosaic virus. 

Typically, the components, comprising a promoter, leader (if 

15 desired) , coding sequence of interest, and transcription 
termination sequence, are put together into expression, 
constructs of the invention. Expression constructs are often 
maintained in a DNA plasmid, which is an extrachromosomal 
element capable of stable maintenance in a host, such as a 

20 bacterium. The DNA plasmid may have two origins of replication, 
thus allowing it to be maintained, for example, in a bryophyte 
for expression and in a prokaryotic host for cloning and 
amplification. Generally speaking it is sufficient if the 
plasmid has one origin of replication for cloning and 

25 amplification in a prokaryotic host cell. In addition,, a DNA 
plasmid may be either a high or low copy number plasmid. A high 
copy number plasmid will generally have a copy number ranging 
from about 5 to about 200, and usually about 10 to about 150. A 
host containing a high copy number plasmid will preferably have 

30 at least about 10, and more preferably at least about 20. Either 
a high or low copy number vector may be selected, depending upon 
the effect of the vector and the foreign protein on the host 
(see, e.g., Brake et al., supra). 

35 Alternatively, the expression constructs can be integrated into 
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the bryophyte genome with an integrating vector. Integrating 
vectors usually contain at least one sequence homologous to a 
bryophyte chromosome that allows the vector to integrate, and 
preferably contain two homologous sequences flanking the 
5 expression construct. An integrating vector may be directed to a 
specific locus in moss by selecting the appropriate homologous 
sequence for \ inclusion in the vector as described and 
exemplified herein. One or more expression constructs may 
integrate. The chromosomal sequences included in the vector can 
10 occur either as a single segment in the vector, which results in 
the integration of the entire vector, or two segments homologous 
to adjacent segments in the chromosome and flanking the 
expression construct in the vector, which can result in the 
stable integration of only the expression construct. 

15 

Usually, extrachromosomal and integrating expression constructs 
may contain selectable markers to allow for the selection of 
bryophyte cells that have been transformed. 

20 Selectable markers may include biosynthetic genes that can be 
expressed in the moss host, such as the G418 or hygromycin B 
resistance genes, which confer resistance in bryophyte cells to 
G418 and hygromycin B, respectively. In addition, a suitable 
selectable marker may also provide bryophyte cells with the 

25 ability to grow in the presence of toxic compounds, such as 
metal. 

Alternatively, some of the above-described components can be put 
together into transformation vectors. Transformation vectors are 
30 usually comprised of a selectable marker that is either 
maintained in a DNA plasmid or developed into an integrating 
vector, as described above. 

Methods of introducing exogenous DNA into bryophyte cells are 
3.5 well-known in the art, and are described inter alia by Schaefer 
D. G. "Principles and protocols for . the moss Physcomitrella 
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patens", (May 2001) Institute of. Ecology, Laboratory of Plant 

Cell Genetics, University of Lausanne Didier . Schaef er@ie- 

pc.unil .ch ; Reutter K. and Reski R. , Plant Tissue Culture and 

Biotechnology September 1996, Vol.2, No. 3; Zeidler M et al., 
5 (1996), Plant Molecular Biology 30:199-205. 

Those skilled in the art are well able to construct vectors and 
design protocols for recombinant nucleic acid sequence or gene 
expression as described above. Suitable vectors can be chosen or 

10 constructed, containing appropriate regulatory sequences, 
including promoter sequences, terminator fragments, poly- 
adenylation sequences, enhancer sequences, marker genes and 
other sequences as appropriate. For further details see, for 
example, Molecular Cloning: a Laboratory Manual: 2nd edition, 

15 Sambrook et al, 1989, Cold Spring Harbor Laboratory Press. Many 
known techniques and protocols for manipulation of nucleic acid,' 
for example in preparation of nucleic acid constructs, 
mutagenesis, sequencing, introduction of DNA into cells and gene 
expression, and analysis of proteins, are described in- detail in 

20 Current Protocols in Molecular Biology, Second Edition, Ausubel 
ef al. eds., John Wiley & Sons, 1992. The disclosures of 
Sambrook et al. and Ausubel et al . are incorporated herein by- 
reference. 

25 Naturally, the skilled addressee will appreciate that each 
nucleic acid sequence coding for the appropriate galT and 
polypeptide to be glycosylated will be under regulatory control 
of its own exogenous promoter and terminator. When two or more 
target proteins are destined to be produced from a single 

30 carrier RNA it is preferable if they are able to be readily 
separated, for example by binding to different protein-specific 
antibodies (monoclonal or polyclonal) in the harvesting phase of 
the bryophyte cell culture system. 

35 As described above, selectable genetic markers may facilitate 
the selection of transgenic bryophyte cells and these* may 
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consist of chimaeric genes that confer selectable phenotypes as 
alluded to herein. 

When introducing selected glycosyl transferase nucleic acid 
5 sequences and polypetide sequences for glycosylation into a 
bryophyte cell, certain considerations must be taken into 
account, well known to those skilled in the art. The nucleic 
acid(s) to be inserted should be assembled within a construct, 
which contains effective regulatory elements, which will drive 
10 transcription. There must be available a method of transporting 
the construct into the cell. Once the construct is within the 
cell membrane, integration into the endogenous chromosomal 
material either will or will not occur. 

15 The invention further encompasses a host cell transformed with 
vectors or constructs as set forth above, especially. a bryophyte 
or a microbial cell. Thus, a host cell, such as a bryophyte 
cell, including nucleotide sequences of the invention as herein 
indicated is provided. Within the cell, the nucleotide sequence 

20 may be incorporated within the chromosome. 

Also according to the invention there is provided a. bryophyte 
cell having incorporated into its genome at least a nucleotide 
sequence, particularly heterologous nucleotide sequences, as 

25 provided by the present invention under operative control of 
regulatory sequences for control of expression as herein 
described. The coding sequence may be operably linked to one or 
more regulatory sequences which may be heterologous or foreign 
to the nucleic acid sequences employed in the invention, such as 

30 not naturally .associated with the nucleic acid sequence (s) for 
its (their) expression. The nucleotide sequence according to the 
invention may be placed under the control of an externally 
inducible promoter to place expression under the control of the 
user. A further aspect of the present invention provides a 

.35 method of making such a bryophyte cell, particularly • a Physco- 
. mitrella patens cell involving introduction of nucleic acid 
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sequence (s) contemplated for use in the invention or at least a 
suitable vector including the sequence (s) contemplated for use 
in the invention into a bryophyte cell and causing or allowing 
recombination between the vector and the bryophyte cell genome 
5 to introduce the said sequences into the genome. The invention 
extends to bryophyte cells,, particularly Physcomitrella patens 
cells containing a GalT nucleotide and/or a nucleotide sequence 
. coding for a polypeptide sequence destined for the addition of a 
mammalian glycosylation pattern thereto and suitable for use in 
10 the invention as a result of introduction of the nucleotide 
sequence into an ancestor cell . 

The term "heterologous" may be used to indicate that the 
gene/sequence of nucleotides in question have been introduced 

15 into bryophyte cells or an ancestor . thereof, using genetic 
engineering, ie by human intervention. A transgenic bryophyte 
cell, i.e. transgenic for the nucleotide sequence in question, 
may be provided. The transgene may be on an extra-genomic vector 
or incorporated, preferably stably, into the genome. A 

20 heterologous gene may replace an endogenous equivalent gene, ie 
one that normally performs • the same or a similar function, or 
the inserted sequence may be additional to the endogenous gene 
or other sequence. An advantage of introduction of a hetero- 
logous gene is the ability to place expression of a sequence 

25 under the control of a promoter of choice, in order to be able 
to influence expression according to preference. Nucleotide 
sequences heterologous, or exogenous or foreign, to a bryophyte 
cell may be non-naturally occurring in cells of that type, 
strain or species. Thus, a nucleotide sequence may include a 

30 .coding sequence of or derived from a particular type of 
bryophyte cell, such as a Physcomitella patens cell, placed 
within the context of a bryophyte cell of a different type or 
species. A further possibility is for a nucleotide sequence to 
be placed within a bryophyte cell in which it or a homologue is 

35 found naturally, but wherein the nucleotide sequence is linked 
and/or adjacent to nucleic acid which does not occur naturally 
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within the cell, or cells of that type or species or strain, 
such as operably linked to one or more regulatory sequences, 
such as a promoter sequence, for control of expression. A 
sequence within . a bryophyte or other host cell may be 
5 identifiably heterologous, exogenous or foreign. 

The present invention also encompasses the desired polypeptide 
expression product of the combination of nucleic acid molecules 
according to the invention as disclosed herein or obtainable in 

10 accordance with the information and suggestions herein. Also 
provided are methods of making such an expression product by 
expression from nucleotide sequences encoding therefore under 
suitable conditions in suitable .host cells e.g. E.coll. Those 
skilled in the art are well able to construct vectors and design 

15 protocols and systems for expression and recovery of products of 
recombinant gene expression. 

A polypeptide according to the present invention may be an 
allele, variant, fragment, derivative, mutant or homologue of 

20 the (a) polypeptides as mentioned herein. The allele, variant, 
fragment, derivative, mutant or homologue may have substantially 
the same function of the polypeptides alluded to above and as 
shown herein or may be a functional mutant thereof. In the 
context of pharmaceutical proteins as described herein for use 

25 in humans, the skilled addressee will appreciate that the 
primary sequence of such proteins and their glycosylation 
pattern will mimick or preferably be identical to that found in 
humans . 

30 "Homology" in relation to an amino acid sequence of the 
invention may be used to refer to identity or similarity, 
preferably identity. As noted already above, high level of amino 
acid identity may be limited to functionally significant domains 
or regions, e.g. any of the domains identified herein. 



35 
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In particular, homologues of the particular bryophyte-derived. 
polypeptide sequences provided herein, are provided by the . 
present invention, as are mutants, variants, fragments and 
derivatives of such homologues. Such homologues are readily 
5 obtainable, by use of the disclosures made herein. Naturally, the 
skilled addressee will appreciate that homologues of the 
glycosylated protein sequences per " se, other than those 
homologues that due to the degeneracy . of the * genetic code give 
rise to amino acid sequences that are true Copies (i.e. 100% 

10 identical) of the mammalian proteins of interest, and especially 
of human proteins of interest, are encompassed within the 
present invention. Thus the present invention also extends to 
polypeptides which include amino acid sequences with GalT 
function as defined herein and as obtainable using sequence 

15 information as provided herein. The GalT homologues may at the 
amino acid level have homology, that is identity, with the amino 
acid sequences described in the prior art as described herein 
i.e. under the database accession numbers provided in the 
examples section, preferably at least about 50%, or at least 

20 55%, or at least about 60%, or at least about 65%, or at least 
about 70%, or at least about 75%, or at least about 80% 
homology, or at least about 85 %, or at least about 88% 
homology, or at least about 90% homology and most preferably at 
least about 95% or greater homology provided that such proteins 

25 have a GalT- activity that fits within the context of the present 
invention. 

In certain embodiments, an allele, variant, derivative, mutant 
derivative, mutant or homologue of the specific sequence may 

30 show little overall homology, say about 20%, or about 25%, or 
about 30%, or about 35%, or about 40% or about 45%, with the 
specific sequence. However, in functionally significant domains 
or regions, the amino acid homology may be much higher. Putative 
functionally significant domains or regions can be identified 

35 using processes of bioinf ormatics, including comparison of the 
sequences of homologues. 
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Functionally significant domains or regions of different 
polypeptides may be combined for expression from encoding 
nucleic acid as a fusion protein. For example, particularly 
advantageous or desirable properties of different homologues may 
5 be combined in a hybrid protein, such that the resultant 
expression product, with GalT function, may include fragments of 
various parent proteins, if appropriate. 

Similarity of amino acid sequences may be as defined, and 

10 determined by the TBLASTN program, of Altschul et ai. (1990) J. 
Mol. Biol. 215: 403-10, which is in standard use in the art. In 
particular, TBLASTN 2.0 may be used with Matrix BLOSUM62 and GAP 
penalties: existence: 11, extension: 1. Another standard program 
that may be used is -BestFit, which is part of the Wisconsin 

15 Package, Version 8, September 1994, .(Genetics Computer Group,' 
575 Science Drive, Madison, Wisconsin, USA, Wisconsin 53711) . 
BestFit makes an optimal alignment of the best segment of 
similarity between two sequences. Optimal alignments are found 
by inserting gaps to maximize, the number of matches using the. 

20 local homology algorithm of Smith and Waterman (Adv. Appl. Math. 
(1981) 2: 482-489). Other algorithms include GAP, which uses the 
Needleman and Wunsch algorithm to align two complete sequences 
that maximizes the number of matches and minimizes the number of 
gaps. As with any algorithm, generally the default parameters 

25 are used,' which for GAP are a gap creation penalty - 12 and gap 
-extension penalty = 4. Alternatively, a gap creation penalty of 
3 and gap extension penalty of 0.1 may be used. The algorithm 
FAST A (which uses the method of Pearson and Lipman (1988) PNAS 
USA 85: 2444-2448) is a further alternative. 

30 

Use of either of the terms "homology" and "homologous" herein- 
does not imply any necessary evolutionary relationship between 
compared sequences, in keeping for example with standard use of 
terms such as "homologous recombination" which merely requires 
35 .that two nucleotide sequences are sufficiently similar to 
recombine under the appropriate conditions. 
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It is to be understood that the teaching of all references cited 
herein is incorporated into the teaching of the specification. 

Examples 

5 

Methods and Materials 

Plant material 

The wild-type strain of Physcomitrella patens (Hedw.) B.S.G. 
10 characterised by Reski et al . (1994) Genome analysis of the moss 
Physcomitrella patens (Hedw.) B.S.G. . Mol Gen Genet 244, 352-359) was 
used. It is a subculture of strain 16/14 which was collected by 
H.L.K. Whitehouse in Gransden Wood, Huntingdonshire, UK and was 
propagated by Engel (1968) Am J Bot 55, 438-446.). 

15 

Construction of pRTlOlVEGF C3 

Human vascular endothelial growth factor 121 (VEGF121) cDNA . 
without leader sequence was excised as an Ndel-Sall fragment 
from pCYTEXP-VEGF 12 i (available from GBF, Braunschweig, Germany) . 

20 This fragment was blunt-ended by the Klenow reaction and 
introduced into pRTlOl (TSpfer et al. (1987) NAR 15, 589) at 
the Smal restriction site to form plasmid pRTlOlVEGF C3 . In this 
construct, the VEGF121 cDNA without leader sequence was placed 
downstream of the CaMV 35 S promoter and terminated by the CaMV 

25 terminator (Gorr (1999) Biotechnologische Nutzung von Physcomi- 
trella patens (Hedw.) B.S.G.. Ph.D. thesis, Hamburg University) . 

Construction of pRTlQlTPVEGF C3 

The sequence for VEGF signal peptide (sorting signal for 
30 secretion) was cloned into pRTlOlVEGF C3 . The signal peptide 
cDNA was amplified from the plasmid pRTlOl P21 (Gorr 
(1999) , supra) using the 5' primer MoB323 5 1 - ATA CTC GAG GAA GAT 
GAA CTT TTC TGC CTG TCT TGG -3 1 (SEQ ID NO 1) containing an Xhol 
restriction side and 3' primer MoB349 5 1 - CTG CCA TGG GTG CAG 
35 CCT GGG ACC AC -3 1 (SEQ ID NO 2) containing a Ncol restriction 
side. The amplified DNA was digested with Xhol and Ncol and 
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ligated into pRTlOlVEGF C3 {Xhol / Ncol digested) resulting in 
pRTlOlTPVEGF C3 . The resulting plasmid contained the coding 
sequences for the VEGF signal peptide and VEGF121 in frame under 
.the control of the CaMV 35 S promoter. 

5 

Standard culture conditions 

Plants were grown axenicallly under sterile conditions in plain 
inorganic liquid modified Knop medium (1000 mg/1 Ca(N0 3 )2 x 4H 2 0 
250 mg/1 KC1, 250 mg/r KH 2 P04, 250 mg/1 MgS0 4 x 7 H 2 0 and 12.5 

10 mg/1 FeS0 4 X 7 H 2 0; pH 5.8 (Reski and Abel (1985) Planta 165, 
354-358) . Plants were grown in 500 ml Erlenmeyer flasks contai- 
ning 200 ml of culture medium and flasks were shaken on a 
Certomat R shaker (B.Braun Biotech International, Germany) set 
at 120 rpm. Conditions in the growth chamber were 25 +/-. 3°C 

15 and a light-dark regime of 16:8 h. The flasks were illuminated 
from above by two fluorescent tubes (Osram L 58 W / 25) 
providing 35 micromols^m" 2 . The cultures were subcultured once a 
.week by disintegration using an Ultra-Turrax homogenizer (IKA, 
Staufen, Germany) and. inoculation of two new 500 ml Erlenmeyer 

20 flasks containing 100 ml fresh Knop medium. 

Protoplast isolation 

After filtration the moss protonemata were preincubated in 0.5 M 
mannitol. After 30 min, 4 % Driselase (Sigma, Deisenhofen, 

25 Germany) was added to the suspension. Driselase was dissolved in 
0.5 M mannitol (pH 5.6-5.8), centrifuged at 3600 rpm for 10 min 
and sterilised by passage through a 0.22 microra filter (Millex 
GP, Millipore Corporation, USA) . The suspension, containing 1% 
Driselase (final concentration), was incubated in the dark at RT 

30 and agitated gently (best yields of protoplasts were achieved 
after 2 hours of incubation) (Schaefer, "Principles and 
protocols for the moss Physcomitrella patens", (May 2001) 
.Institute of Ecology, Laboratory of Plant Cell Genetics, 
University of Lausanne Didier . Schaef er@ie-pc . unil . ch ) . The 

35 suspension was passed through sieves (Wilson, CLF, Germany) with 
pore sizes of 100 microm and 50 microm. The suspension was 
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centrifuged in sterile centrifuge tubes and protoplasts were 
.sedimented at RT for 10 min at 55 g (acceleration of 3; slow 
down at 3; Multifuge 3 S-R, Kendro, Germany) (Schaefer, supra) . 
Protoplasts were gently resuspended in W5 raedium (125 mM CaCl2 x 
5 2H 2 0; 137 mM NaCl; 5.5 mM glucose; 10 mM KC1; pH 5-6; 660-680 
mOsm; sterile filtered) . The suspension was centrifuged again at 
RT for 10 min at 55 g (acceleration of 3; slow down at 3; 
Multifuge 3 S-R, Kendro, Germany) . • Protoplasts were gently re- 
suspended in W5 medium (Rother et al . (1994) J Plant Physiol 
10 143, 72-77) . For counting protoplasts a small volume of the 
suspension was transferred to a Fuchs-Rosenthal-chamber . 

Transformation protocol . . 

For transformation protoplasts were incubated on ice in the dark 

15 for 30 minutes. Subsequently, protoplasts were sedimented by 
centrifugation at RT for 10 min at 55 g (acceleration of 3; 
slow down at 3; Multifuge 3 S-R, Kendro). Protoplasts were - 
resuspended in 3M medium (15 mM CaCl 2 x 2H 2 0; 0.1 % MES; 0.48 M 
mannitol; pH 5.6; 540 mOsm; sterile filtered, Schaefer et al. 

20 (1991) Mol Gen Genet 226, 418-424) at a concentration of 1.2 x 
10 6 protoplasts / ml (Reutter and Reski (1996) Production of a 
heterologous protein in bioreactor cultures of fully differen- 
tiated moss plants, PI. Tissue culture and Biotech., 2., pp. 142- 
147) . 250 microlitre of this protoplast suspension were 

25 . dispensed into a new sterile centrifuge tube, 50 microlitre DNA 
solution (column purified DNA in H 2 0 (Qiagen, Hilden, Germany.) ; 
10-100 microlitre; optimal DNA amount of 60 microgram) was added 
and finally 250 microlitre PEG-solution (40% PEG 4000; 0.4 M 
mannitol; 0.1 M Ca(N0 3 ) 2 ; pH 6 after autoclaving) was added. The 

30 suspension was immediately but gently mixed and then incubated 
for 6 min at RT with occasional gentle mixing. The suspension 
was diluted progressively by adding 1, 2, 3 and 4 ml of 3M 
medium. The suspension was centrifuged at 20°C for 10 minutes at 
55 g (acceleration of 3/ slow down at 3; Multifuge 3 S-R, 

35 Kendro) . 
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The pellet was re-suspended 

a) for transient transformation experiments in 300 microlitre or 
400 microlitre 3M medium. Cultivation of transformed protoplasts 
was performed in 96 well- plates (300 microlitre , Nunclon, Nunc 

5 GmbH, Wiesbaden , Germany) or 48 well plates (400 microlitre, 
Cellstar, greiner bio-one, Frickenhaus.en, Germany) ; 

b) for stable transformation in 3 ml regeneration medium 
(modified Knop medium; 5% glucose; 3% mannitol; 540 mOsm; pH 
5.6-5.8). Regeneration and selection was performed as described 

10 by Strepp et al. (1998) Proc Natl Acad Sci USA 95,4368-4373). 
For selection Knop f s media were supplemented- either with 50 
microgram/ml G418 or with 30 microgram/ml Hygromycin B. 

Co-transformation was performed by transferring 10 microgram 
15 undigested pRT99 (Topfer et al . (1988) Versatile cloning vectors, 
for transient gene expression and direct gene transfer in plant, 
cells. NAR 16, 8725) containing the npt II gene as a selection 
marker in parallel with 10 microgram linearised DNA of each 
knockout and/or integration construct into the protoplasts. 

20 

' MALDI-Tof MS of moss glycans 

Plant material was cultivated in liquid culture, isolated by 
filtration, frozen in liquid nitrogen and stored at -80 °C. 
Protein bands were excised from Coomassie-stained SDS- 
25 polyacrylamide gels. The material was shipped under dry ice. The 
MALDI-TOF MS analyses were done in the laboratory of * Prof. Dr. 
F. Altmann, Glycobiology Division, Institut ftir Chemie, 
Universitat fur Bodenkultur, Vienna, Austria. 

30 0.2 to 9 g fresh weight of Physcomitrella patens was digested 
with pepsin. N-glycans were obtained from the digest as 
described by Wilson et al. (2001). Essentially, the glycans were 
released by treatment with peptide :N-glycosidase A and analysed 
by MALDI-TOF mass spectrometry on a DYNAMO (Thermo BioAnalysis, 

35 Santa Fe, NM) . 
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Synthesis of oligonucleotides 

For design of synthetic oligonucleotides the standard code for 
nucleotides was used- For design of degenerated primers the 
standard code was used: 
5 R = A,G; Y =T,C; W = A, T; S = C,G; M - A,C; K = G, T; H - A,T,C; 
B = G, C,T; V = G,A,C; D = G,A,T; N = G,A,C,.T 

Examples 

10 1. Cloning of - the, 1 , 2-N-acetylqlucosaminyltransf erase I from 
Physcomitrella patens and analysis of the knockout plants 

1.1 Cloning of the cDNA and genomic DNA for GNTI 

A fragment of the cDNA for the GNTI was obtained by performing 
15 two rounds of PCR with degenerated primers on the cDNA from -P. 
patens WT. These primers were created on the basis of a protein 
alignment of the known GNTis from plants. In the first round 
primers GNT (d) 1 ( 5 1 -GTNGCNGCNGTNGTNGTNATGGC-3 1 , SEQ ID NO 3) and 
GTN (d) 3 (5 1 -CCYTTRTANGCNGCNC (TG) NGGNACNCC-3 1 , SEQ ID NO 4) were 
20 used and then the product from this PCR was subjected to 
subsequent PCR with the primers GTN (d) 2 (5 f -TAYAARATN (CA) GN- 
CAYTAYAARTGG-3 1 , SEQ ID NO 5) and GTN (d) 4 ( 5 1 -ARRTAYTGYTTRAARAA- 
YTGNCC-3', SEQ ID NO 6). The PCR resulted in a. 500bp product 
which was cloned into pCR 4 TOPO and sequenced from both ends. 

25 

To obtain the missing 5' -end of the cDNA 5* RACE was performed 
with the primers: 5RACEG3 (5 1 -GTCCGTGTCCAATAAAGGAG-3 1 , SEQ ID NO- 
7), 5RACEG4 ( 5 1 -GTCGGGAGAGATTTCCATGTC-3 1 , SEQ ID NO 8), 5RACEG5 
(5 1 -CTAAGATGACGACCCTTCGG-3 1 , SEQ ID NO 9) which gave an 

30 additional 300bp fragment of cDNA, however still without the 
initial Met. Therefore, on the basis of the new sequence another 
round of 5 1 RACE was performed with the primers: 5 RACE 6 (5 f ~ 
CATCCTGAGAAACAAAAAGTGG-3 1 , SEQ ID NO 10), 5 RACE 7 ( 5 1 -AGTTACAGA- 
CTTCAATGTACG-3 1 , SEQ ID NO 11), and 5RACE8 (5 1 -AATCAGGACGGTTG- 

35 CAAGCC-3 1 , SEQ ID NO 12) which gave an additional 400bp fragment 
containing the initiation codon. 
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To obtain the .missing 3 T -end of the cDNA 3' RACE was- performed 
correspondingly with the primers: 3RACEG1 (5 ? -TTATCCGACCTGAAG- 
• TTTGC-3 ' , SEQ- ID NO 13 ) , 3RACEG2 (5 1 -GACCTACAATTTTGGAGAGC-3 ' , 
SEQ ID NO 14) which resulted in a fragment about 450bp with the 
5 stop codo'n and finally the complete 141 6bp of the cDNA for GNTI 
• (GenBank: AJ429143) . 

The corresponding 5788bp of the genomic sequence was obtained by 
cloning the GNTI gene in three pieces by PCR on WT P. patens 

10 genomic DNA with the specific primers: GNT5F ( 5 1 -TGGGCTTTAACA- 
CAACTTTT-3 1 , SEQ ID NO 15) and GTN6R (5 1 -GCCCTAAGCTTGATCCCTG-3 1 , 
SEQ ID NO 16), GNT21F (5 1 -ATGGCAGATATGGCTCGATTG-3 1 , SEQ ID NO 
17) and 5RACEG5 (5 1 -CTAAGATGACGACCCTTCGG-3 ' , SEQ ID NO 9), 
3RACEG1 (5 f -TTATCCGACCTGAAGTTTGC-3 1 , SEQ ID NO 13) and GNT15R 

15 (5 1 -AGTTTCTATGGTATCTAACTGC-3 1 , SEQ ID NO 18) which were 
' sequenced by primer walking. 

1.2 Creating the knockout construct for GNTI 

20 To generate the knockout construct two fragments were 
synthesized by PCR on the genomic DNA with the primers for the 
5 ' -end of the construct : GNTHT7 (5 ' - GAGCATCCAAGCTTGACCTGG- 3 1 , 
SEQ ID NO 19) and GNTET7 (5 1 -GCACCGTGAATTCTTCTAGCTT-3 1 , SEQ ID 
NO 20) and for the 3 1 -end correspondingly GNTHT3 (5 ' -GGAAGAAC- 

25 AAGCTTCAAAGTGGC-3 f , SEQ . ID NO 21) and GNTPT3 ( 5 1 -GATCCCTGCAGAT- 
CTCAAACG-3 1 , SEQ ID NO 22). The thus obtained fragments of 469bp 
and 835bp of genomic DNA were first cloned into the TOPO-pCR 
plasmid (Invitrogen, USA) . The fragments were then cut out from 
this vector with ScoRI/Hindlll and Pstl/Hindlll, respectively 

30 and cloned into the pCRII vector digested with EcoRI and PstI : 
The resulting plasmid was digested with Hindlll and the nptll 
selection cassette was introduced, which was obtained by 
digestion with the same restriction enzyme from the vector 
pRTlOlneo (Girke et al. 1998). 

35 
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For the transformation the knockout plasmid was digested with 
EcoRI and Bcul restriction enzymes and 30microgram were used for 
the trans formation . 

5 1.3 Pre-screeninq of the transgenic plants 

For the pre-screening of the resistant plants, small ' pieces of 
gametophores (l-5mg) were treated for 30 min at 45°C in 75mM 
' Tris-HCl, pH 8, containing 20 mM (NH 4 .)2S0 4 and 0.1% Tween 20, and 
3 microlitre .of this extract was used for PCR with the following 

10 four pairs of primers : GNT7F (5 1 -GTTCSATGGTTTGAGCAGG-3 1 , SEQ ID 
NO 23) and GNT8R ( 5 1 -GCGACCTTTCCTATTCTCC-3 ? , SEQ ID NO 24) to 
detect a disruption of the original gntJ gene, Nl (5 1 -TACCGAC-: 
AGTGGTCCCAAAG-3 ' , SEQ ID NO 25) and N2 (5 1 -CCACCATGATATTCGGCAAG- 
3', SEQ ID NO 26) to detect the ' presence of the nptll cassette, 

15 GNT5F (5 1 -TGGGCTTTAACACAACTTTT-3 1 , SEQ ID NO 27) and N3 (5 1 - 
TGTCGTGCTCCACCATGTT-3 1 , SEQ ID NO 28) to control the integration 
of the transgene at the 5' end, and N4 { 5 1 -GTTGAGCATATAAGAAAC- 
3', SEQ ID NO 29) and GNT10R (5 ' -CACATTGTTCAATTTGATAGAC-3 1 , SEQ 
ID NO 30) to control the integration at the 3' end. Plants that 

20 gave the expected fragments in all four PCR reactions were 
considered as putative knockouts and selected for further 
analysis. 

Finally 9 transgenic plants were chosen for further molecular 
25 and biochemical analysis. 

1.4 Northern analysis 

Total RNA was isolated from moss tissue by RNeasy kit (Qiagen, 
Germany) . For northern analysis, 5 microgram of total RNA were 

30 subjected to electrophoresis on f ormaldehyde-agarose gels, at 
120V, transferred to Hybond-N nylon membranes (Amersham 
Biosciences), and hybridized with a 32 P-labe.led cDNA probe 
corresponding to 3* gntJ cDNA including part of the homologous 
region used for the knock out and part of the cDNA not present 

35 on the knockout construct The" membranes were washed four times 
at different concentrations of SSC (the final washing step being 
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0.5xSSC, 0.1% SDS at 65 °C) and exposed to x-ray film (Kodak Bio- 
Max MS) at -80 °C for 2 to 3 days. 

Northern analysis with the probe [obtained by PCR on the cDNA 
5 with primers: 3RACBG1 (5 1 -TTATCCGACCTGAAGTTTGC-3 ' , SEQ ID NO 13) 
and GNT15R (5 1 - AGTT T C TAT GGT ATC TAACTGC - 3 f , SEQ ID NO 18) ] 
detected the correct- GNTI transcript solely in the wild type.. 
The lack of the correct transcript in the knockout plants 
-corroborates the disruption of the gntl gene in P. patens . 

10 

The expression of the control gene L21 was not affected in the 
■ transformants. 

1.5. Southern analysis of the transgenic plants 

15 For Southern blot analysis 5 microgram of genomic DNA were used. 
For the detection of the number of integration sites for the 
nptll casette the genomic DNA was digested with PvuII which cuts 
nptll cassette into two fragments and hybridized with the nptJI 
cassette as a probe. This analysis showed that some plants have 

20 very high numbers of integrated selection cassettes whereas 
plant No. 6 has a single integration event. Plants No. 4 and 8 
have also a low number of integrations: between 4-5 per plant. 

In a second Southern analysis, genomic DNA was digested with 
25 EcoRI and hybridized with gntJ probe located on the 3 1 end of 
the gene, out of the region which was used for creating the 
knockout construct. In this case it was expected that transgenic 
plants will have a band which is larger than WT for the size of 
the nptll casette (1500bp) . WT have single signal at 4500bp 
30 whereas transgenic plants have the bands of 6000bp or other but 
do not contain the WT band, confirming disruption of the WT 
locus . 



35 
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1.. 6 MALDI-TOF mass spectrometry 

The N-glycans of Physcomitrella patens WT exhibit the typical 
structural features of plant N-glycans as described in e.g. 
Wilson et al. (2001) Glycobiology 11, 261-274). I.e. fucose in 
5 alpha 1,3-linkage to the Asn-bound GlcNAc, xylose in beta 1,2- 
linkage to the beta mannosyl reidue, Lewis A epitopes (alpha 
1,4-fucosyl and beta 1, 3-galactosyl residues linked to GlcNAc) 
as non reducing terminal elements (tab. 1) . Three GNTi knockout 
plants were analysed. The N-glycans of GNTI knockout plants 
10 exhibited the same structures compared to WT (tab. 1) confirming 
that the knockout was only successful on the "molecular but riot 
on the biochemical level. Therefore, it is assumed that another 
GNTI exists in Physcomitrella patens. 

15 2. Cloning of the alpha 1, 3-f ucosyltransf erase from P. patens 
and analysis of the knockout plants 

2.1. Cloning of the cDNA and genomic DNA for alpha 1,3- 
fucosyltransf erase (alpha 1,3-FT) 

20 Part of the cDNA for the alpha 1,3-FT was obtained by performing 
two rounds of PCR with degenerated primers on the cDNA from P. 
patens WT. These primers were created on the basis of a protein 
alignment of the known alpha 1,3-FTs from Vigna radiata and 
Arabidopsis thaliana . In the first round primers FD4F (5 1 - 

25 TGGGCNGARTAYGAYATGATG-3 ' , SEQ ID NO 31) and FDRl ( 5 1 -TGNGTNA- 
RNCCNADNGGRTADAT-3 1 , SEQ ID NO 32) were used and then the 
product from this PCR was subjected to a subsequent PCR with the 
primers FD4F and FD5R ( 5 f -TGNACNGCNGCCATRTC-3 1 , SEQ ID NO 33). 
The second PCR resulting in a 510bp product which was cloned 

30 into pCR 4 TOPO and sequenced from both ends . 

To obtain the missing 5 '-end of the cDNA, 5' RACE was performed 
with the primers: 5FT4 (5 ' -GTAACATTCGCATAATGG-3 1 , SEQ ID NO 34), 
5FT5 (S'-CGATCATTATGCGCACCAC-S 1 , SEQ ID NO 35), and 5FT6 (5 1 - 
35 GGAAATAAAAGCAGCTCC-3 ? , SEQ ID NO 36) which gave an additional 
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200bp of cDNA, however still without the initial Met. Therefore, 
on the base of the new sequence a second round of 5 1 RACE was 
performed with the primers: 5FT7 (5 1 -AGGGTGAATCTCCATAGCC-3 1 , SEQ 
ID NO 37), 5FT8 ( 5 ' -CATCTGCCTGACCCTCACC-3 1 , SEQ ID NO 38), and 
5 5FT9 (5 1 -GCCTTGAACACGCATGGC-3 1 , SEQ ID NO 39) which gave an 
additional 15 0bp fragment but still without the initiation 
codon. Finally, a third round was performed with the primers: 
5FT9, 5FT10 (5 1 tCGATACAACCAGCACAGG-3 1 , SEQ ID NO 40), and 5FT11 
(5 1 -CTTCTCTAGCCATTCTGCC-3 1 , SEQ ID NO- 41), which gave an 
"10 additional 400bp of sequence containing the initiation codon. 

To obtain missing 3 1 -end of the cDNA 3' RACE was performed 
correspondingly with the primers: 3FT1 (5 1 -GCAGTGGAAGTTTAATGGTC- 
3', SEQ ID NO 42) and 3FT2 (5 1 -TCGTTTCTAGCTCTAGTAGAC-3 1 , SEQ ID 
15 NO 43) which resulted in a fragment of about 550bp with the stop 
codon and finally the complete 1711bp of the cDNA for the alpha 
1,3-FT (GenBank: partial cDNA: AJ429145) . 

The corresponding 3083bp of the genomic sequence was obtained by 
20 cloning the alpha 1,3-FT gene in three . pieces by PCR from WT 
P. patens genomic DNA with the specific primers: FTA9F (5 f - 
ATGCTCCCAGCCCAAGAC-3 1 , SEQ ID NO 44) and FTA10R ( 5 1 -TGTCTACT- 
AGAGC T AGAAACG - 3 1 , SEQ ID NO 45 ) , FT18F (5 1 -TAGGGAGTAAATATG- 
AAGGG-3 1 , SEQ ID NO 46) and 5FT5 (5 ' -CGATCATTATGCGCACCAC-3 1 , SEQ 
25 ID NO 35), 3FT1 (5 ' -GCAGTGGAAGTTTAATGGTC-3 1 > SEQ ID NO 42) and 
FTA12R (5 1 -TACTTCCAATTGAAGACAAGG-3 ! , SEQ ID. NO 47) which were 
sequenced by primer walking. 

2,2. Creating the knockout construct for alpha 1,3-FT . 
30 To create the knockout construct, PCR on the genomic DNA was 
performed with the primers: FT15F (5 T -AATGTTCTGTGCCATGCG-3 1 , SEQ 
ID NO 48) and FT16R ( 5 1 -TGCTTCAAATGGGCTAGGG-3 1 , SEQ ID NO 49). 
The thus obtained 2156bp fragment was cloned into the pCR4 TOPO 
(Invitrogen, USA) vector. The Nptll cassette was synthesized by 
35 PCR with a proof reading polymerase on the pRTlOlneo plasmid 
(Girke et al. 1998) with the primers: nptll/Ndel-F <5'- 
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ATGCCATATGGCATGCCTGCAGGTCAAC-3 ' , ■ SEQ ID NO 50) to generate a 
. Ndel. restriction site and nptII/BstZ17I-R ( 5 \ -GCATGTATACGCAT- 
GCCTGCAGGTCACTG-3 1 , SEQ ID NO 51) to generate a BstZllI site. 
The PCR product was. also cloned into pCR4 TOPO ( Invitrogen,- 
5 USA) . Both plasmids were digested with Ndel and Bs tZ17 1 
restriction enzymes. By this restriction a 195bp fragment 
containing a part of the fourth intron and a part of the fifth 
. exon was cut out from the alpha 1,3-FT genomic fragment and 
replaced by the nptll cassette. 

10 

For the transformation of P. pat&ns resulting the knockout 
construct was digested with NotI and Mssl restriction enzymes 
and 25 microgram were used for transformation. 

15 2.3. Pre-screening of the transgenic plants 

For the pre-screening of the resistant plants, small pieces of 
gametophores (l-5mg) were treated for 30 min at 45 °C in 75mM 
Tris-HCl, pH 8 containing 20 mM (NH 4 ) 2 S0 4 and 0.1% Tween 20, and 
3 microlitre of this extract was used for PCR with the following 

20 four pairs of primers : FT14F (5 1 -ACAAAGTTACATACTCGCG-3 1 , SEQ ID 
NO 52) and; FTA12R .( 5 1 -TACTTCCAATTGAAGACAAGG-3 1 , SEQ ID NO 47) to. 
detect a disruption of the original ft gene, Nl (5. 1 -TACCGACAG- 
TGGTCCCAAAG-3 1 , SEQ ID NO 25) and N2 (5 1 -CCACCATGATATTCGGCAAG- 
3 1 , SEQ ID NO 2 6) to detect the presence of the nptll cassette, 

25 FT14F (5 1 -ACAAAGTTACATACTCGCG-3 1 , SEQ ID NO 52) and N3 (5 1 - 
TGTCGTGCTCCACCATGTT-3 1 , SEQ ID NO 28) to control the integration 
- of the transgene at the 5' end, FTA12R (5 1 -TACTTCCAATTGAA- 
GACAAGG- 3 1 , SEQ ID NO 47) and N4 (5 1 -GTTGAGCATATAAGAAAC-3 1 , SEQ 
ID NO 29) to control the integration at the 3 1 end. Plants that 

30 gave the expected fragments by all four PCR reactions were 
considered as putative knockouts and selected for further 
analysis . 

Finally 9 plants were chosen for further molecular and 
35 biochemical analysis . 



WO 2004/057002 PCTYEP2003/014576 

34 

2.4. RT-PCR 

Total RNA was isolated from moss tissue by RNeasy kit (Qiagen, 
Germany) . Reverse transcription PCR was performed according to 
standard protocol. For the RT-PCR primers FTA9F ( 5 1 -ATGCTCCC- 
5 AGCCCAAGAC-3 1 , SEQ ID NO 44) and FTA10R ( 5 1 -TGTCTACTAGAGCTAGA- 
AACG-3 1 , SEQ ID NO 45) located in the central region of the cDNA 
for alpha 1,3-FT were used. Using these primers a 475bp 
transcript, was detected only in the WT whereas all transgenic 
plants did not. give any PCR products. The absence of the 
10 detectable transcript with the primers located on both sides of 
integrated nptll cassette confirms that all plants analysed are 
knockouts. 

As a control RT-PCR was performed with primers for APS reductase 
(Koprivova et al. (2002) J. Biol. Chem. 277, 32195-32201): R10 
15 (5 1 -TCTTTCACTATTCGGTGACG-3 1 , SEQ ID NO 53) and Rll <5'~ 
CGACCACAACATTAGATCC-3 1 , SEQ ID NO 54), which amplified a 900bp 
fragment from all plants . 

2.5 MALDI-TOF mass spectrometry 

20 The N-glycans of Physcomitrella patens WT exhibit the typical 
structural features of plant N-glycans as described in e.g. 
Wilson et al . (2001) , supra. I.E. fucose in alpha 1,3-linkage to 
the Asn-bound GlcNAc, xylose in beta 1,2-linkage to the beta, 
mannosyl reidue, Lewis A epitopes (alpha 1,4-fucosyl and beta 

25 1, 3-galactosyl residues linked to. GlcNAc) as non reducing 
terminal elements (tab. 1). Three alpha 1,3 FT knockout plants 
were analysed. No alpha 1,3 fucose residues linked to the Asn- 
bound GlcNAc could be detected on the N-glycans of alpha 1,3 FT 
knockout plants (tab. 1) confirming that the knockout of alpha 

30 1,3 fucosyltransf erase in Physcomitrella patens was completely 
successful. 
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2A V Cloning of the alpha 1, 3-f ucosyltransf erase flanking regions 
from P. patens and construction of the knockout constructs 

2A.1. Cloning of flanking DNA corresponding to the alpha 1,3- 
5 fucosyltransf erase (alpha 1,3-FT) gene 

The 3 '-flanking DNA corresponding to the alpha 1, 3-fucosyltrans- 
ferase gene of Physcomitrella patens was amplified by PCR from 
genomic DNA using primer MOB558 (5 1 -GTTCCGCGGTGATCCCGTTTTCATATCA- 
GTGTATT-3 1 , SEQ ID NO 84) containing a Sacll restriction site 

■ 10 and primer MOB557 ( 5 1 -TTTGAGCTCTACGTAACAATAACATAAAATATCACA-3 f , 
SEQ ID- NO 8-5) containing a SnaBX and a Sacl site. The PCR 
product ( 5 ' - GTTCCGCGGTGATCCCGTTTTCATATCAGTGTATTATCATCAGTGACTGC- 
ATATTGACACCCAATTCTGATGATTTTTTATTTTTTATTTTTTATTTTTTTTGGTATGGTTACA 
TGCTTTTCAGAGGTTTCTATGCCGCTGAGTATTTTCCTGAATCGCGAGGTGTGACAGGTTATCT 

15 GCGCCGTCCACCCAATATTTTATGATGAGTCGATGATTCGTGAGACTAATCTAGCTTAACCTTT 
TTCTTACTGGCAAGTCAAAATTGAGTTTAAAATATTTCAGTATCCTGTTAGTAATTTCAGACAC 
ATGTATTCTATGTCTCATACTCTTTACGTGAAAGTTCAACTGACTTATATTTTGTCGTTTTTCT 
GTAGATCACTGTTTTAGCGCATACAAAGACAATTGTCTAAATATTTTTAAAGAAGGTGATATTT 
T ATT AT AAGAT AGAAGT C AAT AT-GTTT TT TT GT T ATGC ACATGACT TGAAT AAAAT AAATTTTT 

20 TTGTTAGATTTAAATACTTTTTGAATTATAGCTTTGTTGAAATTAAGGAATTTATATTCATAAG 
AAGCTACTCGAACAAATTTACAAAGAGAACATTTGATAAGTAAAAGTAATTAAAAGTTTTTTTT 
AATTTAAAAAGATTAATTTTTATTAATAAGAAGAACTTGGAAAGTTAGAAAAATATTTAACTTT 
AAAAATTAAGAAAACAAGGCAAAACTTTAATTTACAAATACTTAATGTAGATTAATTTTCTTAT 
TATATATTAGCACAAATTATCATTATGTGATATTTTATGTTATTGTTACGTAGAGCTCAAA 

25 3 f , SEQ ID NO 86) was digested with Sad and Sacll and cloned 
into the plasmid pBS (Sacl/ Sacll digested) . 

The 5 '-flanking DNA corresponding to the alpha 1, 3-fucosyl trans- 
ferase gene of Physcomitrella patens was amplified by PCR from 

30 genomic DNA using primer MoB555 (5 1 -CGCGTTAACTCTCTCTATCTCTC- 
TCTGTGTTGCG-3 1 , SEQ ID NO 87) containing a Hindi restriction 
site and primer MoB55 6 ( 5 1 -CGAGAATTCTCACTTAGAAGAAGCCCAATCCT-3 1 , 
SEQ ID NO 88) containing an EcoRI site. The PCR product (5 f - 
CGCGTTAACTCTCTCTATCTCTCTCTGTGTTGCGTTTGATCAGGGGTTTTAGGGTTTGGGTCCA 

35 GGGTTCCGAGGAGTATCGTCACGTGTATTGCGGTCTTGTTGGAGATTCCTCAGTTGTGCATGTA 
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GATATAAACTTAGTTTAGTCCACGATCGGT-TTCTAATCGTGGATTTTTGTGGGTTtCGGTCGTT - 
GAGCAAGAATTTTGTGAATTTTTTGTATTGGGGGAAGGAAATGGGGTTATGGCGATATCGTTTT 
CGTTGGGTTCAACGTGATCGGTGAGCTCCAGGAAGGGCTGGTCACTCACAATCCGGTATTCGTC 
TCATCGAGACGCATTTATCGGTTCATTATATGTATATATATATATATATATATATGCAGAGTCG 
5 ATTGTGTTGCAATTTCTGAACTAGGTACTGTTGAATTGTAGATTGCCTTCAAGTAGCTCTCGAT 
GTTGGAATGACGSACACAAATTCTGCTACTGAATGAGACCATATTCTGCACCGTTAATTGGTTT 
TATGAATATATGGTGTCGAATTACATTCTGTCTCGAATCCATGCGCCCTTTCTGCACGAACGTT 
GGTTTGTAGTTGTAGTGCAGCCAGTGTGTTTGGTTTAGGATTATGCTTTGACGATCGATGAGTC 
CGTTTCATGGTTTTATACTTGTCATTTATCTTCTTGTGATTTTTTGTTTACAAATGTTCCCCCA 

10 ATTGTAACGTGGGACTTTCGTGTGTGGTGGTTGCTCAAATTGATAGTTTTGGTCATTTGATTTG 
CGGAGAGCAATCGGTGTCATGGAAAATCCCTTCGACTGCTTTGATCCAATCAAAGTTCTGCTTG 
AGCCAATGTGAGAGGTGGAGGATTGGGCTTCTTCTAAGTGAGAATTCTCG -3 1 , SEQ ID . 
NO 89) was digested with Hindi and EcoRI and cloned into the 
plasmid pBS containing the 3 '-flanking DNA corresponding to the 

15 alpha 1 , 3-f ucosyltransf erase gene of Physcomitrella patens 
(HincII/EcoRI digested) . 

2A.2. Creating the knockout construct for the replacement of the 
alpha 1,3-FT gene 

20 To create the knockout construct a expression cassette 
containing the 35 S promoter, the 297 bp ftsZl cDNA fragment 
coding for the FtsZl chloroplast transit peptide of 
Physcomitrella patens, the coding sequence for GFP and the nos 
terminator (Kiessling et al. (2000) J. Cell Biol. 151, 945-950) 

25 was used. The expression cassette was subcloned into pZERO by 
digestion of p FtsZl ( 1-93 ) -GFP with Hindlll and EcoRI and cloning 
into Hindi 1 1 /.EcoRI digested pZERO, resulting in the plasmid 
pZEROftszGFP. The expression cassette was cut out . from 
pZEROftszGFP by digestion with ffindlll, blunting by incubation 

30 with Klenow fragment and subsequently digestion with EcoRI. To 
create the knockout construct, the digested expression cassette 
was cloned into the vector described in 2A.1 which was digested 
with Smal and EcoRI . 

35 The final plasmid was digested with Kpn I/Sna BI resulting in 
the linearised DNA containing the 3' -flanking DNA corresponding 
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to the alpha 1, 3-fucosyltransf erase gene of Physcomitrella 
patens, TPftsZGFP fusion gene under the control of the 35.S 
promoter and the nos terminator and the 5 '-flanking DNA. 
corresponding to the alpha 1, 3-fucosyltransf erase gene of 
5 Physcomitrella patens. This linearised plasmid was used for the 
knockout of the alpha 1, 3-fucosyltransf erase gene of 
Physcomitrella patens. 

2A.3. Generating plants with alpha 1,3 fucosyltransf erase and 
10 beta 1,2 xylosyltransf erase knockouts 

For generating plants without alpha 1,3-fucosyl and beta 1,2- 
xylosyl residues protoplasts derived of Physcomitrella patens 
protonema were transformed with the. constructs described in 
chapters 2A and 3. Transformation could be performed 

15 subsequently as well as co-transformation with both constructs. 
In this case the knockout of the alpha 1 , 3-fucosyltransf erase 
gene was performed not only by a gene disruption but by a gene 
replacement. As an additional selection marker in the subsequent 
transformation procedure a plasmid containing the gene 

20 responsible for hygromycin resistance (pCambial305) was used for 
cotransformation. 

2A.4 MALDI-TOF mass spectrometry 

The N-glycans of Physcomitrella patens WT exhibit the typical 
25 structural features of plant N-glycans as described in e.g. 
Wilson et al. (2001)., supra. I.e. fucose in alpha 1,3-linkage to 
the Asn-bound " GlcNAc, xylose in beta 1,2-linkage to the beta 
mannosyl residue, Lewis A epitopes (alpha 1,4-fucosyl and beta 
1, 3-galactosyl residues linked to GlcNAc) as non reducing 
30 terminal elements (Tab. 1) . One transgenic plant was analysed. 
No alpha 1,3 fucosyl residues linked to the Asn-bound GlcNAc nor 
beta 1,2-xylosyl residues could be detected on the N-glycans of 
these plants. 
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3, Cloning of the beta 1, 2-xylosyltransf erase from P. patens and 
analysis of the knockout plants 

3.1. Cloning of the cDNA and genomic DNA for beta 1,2-xylosyl- 
5 transferase (beta 1,2 XT) 

Part of the cDNA for the. beta 1,2-XT was obtained by performing 
two rounds of PCR with degenerated primers on the cDNA from P. 
patens WT. These primers were created on the basis of a protein, 
alignment of the known betal, 2-XTs. from plants. In the first 

10 round primers XDFl (5 1 -TGYGARGSNTAYTTYGGNAAYGG-3 1 , SEQ ID NO 55) 
and XDR1 (5 ' -GCNCKNAYCATYTCNCCRAAYTC-3 1 , SEQ ID NO 56) were used 
and then the product from this PCR was subjected to a subsequent 
PCR with the primers XDF2 ( 5 1 -GGNGGNGARAARYTNGARRANGT- 3 1 , SEQ ID 
NO 57) and XDRl . The second PCR gave a 610bp product which, was 

15 cloned into pCR 4 TOPO (Invitrogen, USA) and sequenced from both 
ends - 

To obtain the missing 5 '-end of the cDNA, 5 f RACE was performed 
with the primers: 5XT1 (5 1 -TCCTCCTTCTCTGGGACC-3 T , SEQ ID NO 58), 

20 5XT2 (5 1 -AGCTCCAGTTGTGAAATATGG-3 1 , SEQ ID NO 59) and 5XT3 (5 1 - 
TTCTTCCTCATTTCGTCCC-3 1 , SEQ ID NO 83) which gave an additional 
360bp of cDNA f however still without the initial Met. Therefore, 
on the basis of the new sequence a second round of 5 ! RACE was 
performed with the primers: 5XT4 (5 ' -CTTCCTTCACCACACTAC-3 ' , SEQ 

25 ID NO 60), 5XT5 • (5 V-TAGCATGACTGTGTGGCC-3 1 , SEQ. ID NO 61) and 
5XT6 (5 1 -AAAGGCTTGAGTGTAGCC-3 1 , SEQ ID NO 62) which gave an 
additional 390bp fragment containing the initiation codon. 

To obtain missing 3'-end of the cDNA 3' .RACE was ' performed 
30 correspondingly with the primers: 3XT1 ( 5 1 -GCCTTTCTTGCACGGGTTG- 
3', SEQ ID NO 63) and 3XT2 (5 1 -GGACATTCCAAATAATCCC-3 T , SEQ ID NO 
64) which resulted in a fragment of about 94 0bp with the stop 
codon and finally the complete 2300bp of the cDNA for beta 1,2- 
XT (GenBank: 1788 bp corresponding to the coding region: AJ 
35 429144), The 2388bp of the genomic sequence was obtained by 
. cloning the beta 1,2-XT gene in three pieces by PCR on WT P. 
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patens genomic DNA with the specific primers: XT14F (5 1 - 
TTACGAAGCACACCATGC-3 1 , ■ SEQ ID NO 82) and XT15R (5 1 -GTCCTGTTAA- 
ATGCCTTGC-3 1 , SEQ ID NO 65), XT-M1F (5 1 -AGGTTGAGCAATCATATGGC-3 1 , 
SEQ ID NO 66) and 5XT2, 3XT1 and XT11R (5 1 -ATCCCAGAAATATCTGATCC- 
5 3 1 , SEQ ID NO 67) which were sequenced by primer walking. 

3.2. Creating the knockout construct for beta 1,2-XT 

To create the knockout construct PCR on the genomic DNA was 
performed with the primers: XT12F ( 5 1 -TGTGAGGCGTTCTTTGGC-3 1 , SEQ 

10 ID NO 68) and XT11R ( 5 1 -ATCCCAGAAATATCTGATCC-3 1 , SEQ ID NO 67). 
The - thus obtained fragment of 2066bp fragment was cloned into 
pCR4 TOPO vector (Invitrogen, USA) . The nptll cassette was 
synthesized by PCR on the pRTlOlneo plasmid (Girke et al. 1998) 
by proof reading polymerase with the primers: nptll/Sall-F (5 1 - 

15 ATGCGTCGACGTCAACATGGTGGAGCACG-3 1 . SEQ ID NO 69) . for creating 
. Sail restriction site and nptll/Ndel-R ( 5 1 -GCATCATATGTCACTGGAT- 
TTTGGTTTTAGG-3 1 , SEQ ID NO 70) for creating Ndel site. The PCR 
product was also cloned into pCR4 TOPO (Invitrogen, USA) . Both 
plasmids were digested with Ndel and Sail restriction enzymes. 

20 From the plasmid with genomic- DNA a 380bp fragment containing 
part of the second exon and part of the second intron was -cut 
out and replaced by the nptll cassette was introduced. 

For the transformation of P. patens the resulting knockout 
25 construct was digested with NotI and Spel restriction enzymes 
and 25 microgramm were used for transformation. 

3.3. Pre~screening. of the transgenic plants 

For the pre-screening of the resistant plants, small pieces of 
30 gametophores (l-5mg) were treated for 30 min at 45 °C in 75mM 
Tris-HCl, pH 8 containing 20 mM (NH 4 ) 2 S0 4 and 0.1% Tween 20, .and 
3 microlitre of this extract was used for PCR with the following 
four pairs of primers : XT-M1F (5 1 -AGGTTGAGCAATCATATGGC-3 1 , SEQ 
ID NO 66) and XT13R (5 1 -ACGATCCAAAATCTGGACGC-3 f , SEQ ID NO 71) 
35 to detect a disruption of the original xt gene. Nl (5 1 - 
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TACCGACAGTGGTCCCAAAG-3 1 , SEQ ID NO 25) and N2 (5 f -CCACCATG- 
ATATTCGGCAAG-3 1 , SEQ ID NO 26) to detect the presence of the 
nptll cassette, XT-M1F (5 1 -AGGTTGAGCAATCATATGGC-3 1 , SEQ ID NO 
66) and N3 ( 5 1 -TGTCGTGCTCCACCATGTT-3 ? , SEQ ID NO 28) to control 
5 the integration of the- transgene at the 5 r end, XT13R {5 1 - 
ACGATCCAAAATCTGGACGC-3 1 , SEQ ID NO 71) and N4 (5 1 -GTTGAGCATA- 
TAAGAAAC-3 1 , SEQ ID NO 29) and to control the integration at the 
3' end. Plants that gave the expected fragments in all four PCR 
reactions were considered as putative knockouts and selected for 
10 further analysis. 

Finally 9 plants were . chosen for further molecular and 
biochemical analysis. 

15 3,4. RT-PCR 

Total RNA was isolated from moss tissue by RNeasy kit (Qiagen, 
Germany) . Reverse transcription PCR was performed according to 
standard protocol. For . the RT-PCR primers XT 1 4 F (5 1 -TTACGAA- 
' GCACACCATGC-3 1 , SEQ ID NO 82) and XT15R (5 1 -GTCCTGTTAAATGCCTGC- 

20 3', SEQ ID NO 65) located in the central region of cDNA for beta 
1,2-XT and on both sides of npt II cassette were used. Using 
these primers- a . 290bp transcript was detected only in the WT 
whereas all transgenic plants did not give any PCR products. The 
absence of the transcript with the primers located on both sides 

25 of the integrated npt I I cassette confirms that all plants 
analysed are knockouts. 

As a control RT-PCR was performed, with primers for APS reductase 
(Koprivova et'al. 2002), supra): R10 (5 1 -TCTTTCACTATTCGGTGACG- 
30 3', SEQ ID NO 53) and Rll ( 5 1 -CGACCACAACATTAGATCC-3 1 , SEQ ID NO 
54), which amplified a 900bp fragment from all plants. 



35 
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3,5 MALDI-TOF mass spectrometry 

The N-glycans of Physcomitrella patens WT exhibit the typical 
structural features of plant N-glycans as described in e.g. 
Wilson et al . (2001), supra). I.e. fucose in alpha 1,3-linkage 
5 to the Asn-bound GlcNAc, xylose in beta 1,2-linkage to the beta 
mannosyl reidue, Lewis A epitopes (alpha 1,4-fucosyl and beta 

1. 3- galactosyl residues linked to GlcNAc) as non reducing 
terminal elements (tab. 1). Three beta 1,2 XT knockout plants 
were analysed. No beta- 1,2 xylosyl residues linked to the beta- 

10 mannosyl residue could be detected on the N-glycans of beta 1,2 
XT knockout plants (tab. 1) confirming that the knockout of beta 
1,2 xylosyltransf erase in Physcomitrella patens was completely 
successful. 

15 4. Cloning of the cDNA for human beta-1, 4-galactosyltransf erase 

cDNA from human liver (Invitrogen, USA) was used for isolation 
of beta 1, 4-galactosyltransferase (GalT, GeriBank: X55415) cDNA. 
By using 5/ primer GalTXh-F (5'- TTCTCGAGACAATGAGGCTTCGGG- 
AGCCGCTC -3', SEQ ID NO 72) containing a ' Xho I restriction site 

20 and 3 1 primer GalTXb-R (5 1 -GGTCTAGACTAGCTCGGTGTCCCGATGTCC-3 1 , 
SEQ ID NO 73) containing a Xba I restriction site for PGR with 
Elongase enzyme mix (Invitrogen, USA) a 1.2 kb DNA fragment was 
amplified. The 1.2 kb fragment was cloned into pCR4-TOPO 
(Invitrogen, USA) . The fragment was cut out from this vector 

25 with Xho I and Xba I and cloned into pRTlOl (TSpfer et al . 
(1987) NAR 15 , 5890) digested with Xho I and Xba I. The 
resulting plasmid pRTlOl-GalT contained the cDNA of human beta 

1. 4- galactosyltransferase under the control of CaMV 35S promoter 
and CaMV terminator. 

30 

5. Creating the knockout construct for beta 1, 2-xylosyltrans- 
ferase and integration of human beta 1, 4-galactosyltrans-f erase 
From genomic DNA of Physcomitrella patens a 1.56 kb fragment of 
the beta 1, 2-xylosyltransf erase gene was amplified, by using 
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primer XTB-F ( 5 1 - T TGG AT CCT C AAT T AC G AAGC AC ACC AT GC - 3 1 , SEQ ID NO 
74) and primer' XTB-R (5 1 -TTGGATCCTCCTCCCAGAAACATCTGATCCAG-3 1 , 
SEQ ID NO 75) , both primers introduced Bam HI restriction sites. 
The amplification product was cloned into pCR4-TOPO (Invitrogen, 
5' USA) . The cloned beta - 1, 2-xylosyltransf erase gene fragment 
contained a single Hind III restriction site. In this Hind III 
restriction site the cDNA of beta 1, 4-galactosyltransf erase 
under the control of CaMV 35S promoter and CaMV terminator was 
introduced by ligation, resulting in the plasmid pCR4-XTko- 

10 GalTki. Digestion of pCR4-XTko-GalTki with Bam HI resulted in a 
fragment which contained the cDNA of beta 1, 4-galactosyl- 
transferase under the control of CaMV 35S promoter and CaMV 
terminator flanked 5' and 3' by sequences homolog to the beta 
1, 2-xylosyltransferase gene of- Physcbmitrella patens. This 

15 fragment was used ■ for knock out experiments. 

5.1 PCR 

For PCR analyses genomic DNA isolated from the putative knockout 
plants was used. The primers MoB 521 (5 1 -TTGCCGCTATCTACTTGT- 

20 ATGCTAACGT-3 1 , SEQ ID NO 76) and MoB 575 ( 5 1 -TGCCGTGGATGTGCTAG- 
ATAATCTT-3 T , SEQ ID NO 77) were located on the sequences 
homologous to the beta 1, 2-xylosyltransf erase on both sides of 
the beta 1, 4-galactosyltransf erase cassette. A fragment of 339 
bp corresponding to the expected beta 1, 2-xylosyltransf erase 

25 sequence was amplified from WT. From knockout plants a 2.1 kbp 
fragment corresponding to the introduced beta 1, 4-galactosyl- 
transferase cassette was amplified and no 339 bp fragment could 
be detected, confirming the knockout of beta 1, 2-xylosyl- 
transferase as well as the integration of the beta 1,4-galacto- 

30 syltransferase cassette. 



5.2 MALDI-TOF mass spectrometry 

The N-glycans .of Physcomitrella patens WT exhibit the * typical 
structural features of plant N-giycans as described in e.g. 
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Wilson et al . (2001), supra. I.e. fucose in alpha 1,3-JLinkage to 
the Asn-bound GlcNAc, xylose in beta 1,2-linkage to the beta 
mannosyl reidue, Lewis A epitopes (alpha 1,4-fucosyl and beta 
1, 3-galactosyl residues linked to GlcNAc) as non reducing 
5 terminal elements (tab. 1). Three beta 1,2 XT knockout, be.ta 
■ 1,4 GT integration plants were analysed- No beta 1,2 xylosyl 
residues linked to the beta-mannosyl residue could be detected 
on the N-glycans of these plants. The peak at 2235 found in WT 
describing the- (GF) (GF)XF structure was shifted in the N-glycan 
10 pattern of the transgenic plants confirming the activity of the 
human beta 1,4 galactosyltransf erase in Physcomitrella patens. 

6. Creating the knockout construct for alpha 1,,3-fucosyltrans- 
ferase and integration of human beta 1 , 4-galactosyl-transf erase 

15 -From genomic DNA of Physcomitrella patens a 2.66 kb fragment of 
the alpha 1 , 3-f ucosyltransf erase gene was amplified by using 
primer FTB-F (5 1 -TAGGATCCAGATGATGTCTGCTCGGCAGAATGG-3 1 , SEQ ID NO 
78 ) and primer FTB-R ( 5 1 -CTGGATCCTTGTAGATCCGAAGGTCTGAGTTCC-3 1 , 
SEQ ID NO 79), both primers introduced Bam HI restriction sites. 

20 The amplification product was cloned into pCR4-T0P0 (Invitrogen, 
USA). The cloned alpha 1, 3-f ucosyltransf erase gene fragment 
contained two Hind III restriction sites. In these Hind III 
restriction sites the cDNA of beta 1, 4-galactosyltransf erase 
under the control of CaMV 35S promoter and CaMV terminator was 

25 introduced by ligation, resulting in the plasmid pCR4-FTko- 
GalTki. Digestion of pCR4~FTko-GalTki with Bam HI resulted in a 
fragment which, contained the cDNA of beta 1, 4-galactosyltrans- 
ferase under the control of CaMV 35S promoter and CaMV 
terminator flanked 5' and 3 1 by sequences homolog to the beta 

30 1, 3-fucosyltransferase gene of Physcomitrella patens. This 
fragment was used for knock out experiments . 

6.1 PCR 

For PCR analyses genomic DNA isolated from the putative knockout 
35 plants was used. The primers MoB 435 (5 1 -TCCTACCTGCGGAGCAACAGA- 
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TATTG-3 1 , SEQ ID NO 80) and MoB 495 ( 5 1 -GTGGACCCAGATTTGCTGGTGC- 
ACTTG-3 1 , SEQ ID NO 81) were located on the sequences homologous 
to the alpha 1, 3-f ucosyltransf erase on both sides of the beta 
1, 4-galactosyltransf erase cassette, A fragment of 2 kbp corres-. 
ponding to' the expected alpha 1 , 3-f ucosyltransf erase sequence 
was amplified from WT.- From knockout plants a 2.8 kbp fragment 
corresponding to the introduced beta 1, 4-galactosyltransf erase 
cassette was amplified and no 2 kbp fragment could be detected, 
confirming the knockout of alpha 1, 3-fucosyltransferase as well 
as the integration of the beta 1, 4-galactosyltransf erase 
cassette.. 

6.2 MALDI-TOF mass spectrometry 

The N-glycans of Physcomitrella patens WT exhibit the typical 
structural features of plant N-glycans as described in e.g. 
Wilson et al. (2001), supra. I.e. fucose in alpha 1,3-linkage to 
the Asn-bound GlcNAc, xylose in beta 1,2-linkage to the beta 
. mahnosyl reidue, Lewis A epitopes (alpha 1,4-fucosyl and beta 
1, 3-galactosyl residues linked to GlcNAc) as non reducing 
terminal elements (tab. 1). Three alpha 1,3 FT knockout, beta 
1,4 GT integration plants were analysed. No alpha 1,3 fucosyl 
residues linked to the Asn-bound GlcNAc could be detected on the 
N-glycans of these plants. The peak at 2235 found in WT 
describing the (GF) (GF)XF structure was shifted in the N-glycan 
pattern of the transgenic plants confirming the activity of the 
human beta 1, 4 galactosyltransf erase in Physcomitrella patens. 

7. Generating plants with alpha 1,3 fucosvltransf erase and beta 
1,2 xylosyltransf erase knockouts and beta 1 , 4-qalactosyl- 
transferase integration 

For generating plants without alpha 1,3-fucosyl and beta 1,2- 
xylosyl residues and with beta 1,4 linked galactosyl residues 
protoplasts derived, of Physcomitrella patens protonema were 
transformed with the constructs described in chapter 5 and 6. 
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Transformation could be performed subsequently as well as co- 
transformation with both constructs. 



7.1 PCR 

5 For PCR analyses same procedures as described in 5.1 and 6.1 
were performed. 

7.2 MALDI-TOF mass spectrometry 

The N-glycans of Physcomitrella patens WT exhibit the typical 
10 structural features of plant N-glycans as described in e.g. 
Wilson et al. (2001), supra. I.e. fucose in alpha 1,3-linkage to 
the Asn-bound GlcNAc, xylose in beta 1,2-linkage to the beta 
mannosyl reidue, Lewis A epitopes (alpha 1,4-fucosyl and beta 
1, 3-galactosyl residues linked to GlcNAc) as non reducing 
15 terminal elements (tab. 1) . • Three transgenic plants were 
analysed. No alpha 1,3 fucosyl residues linked to the Asn-bound 
GlcNAc nor beta 1,2-xylosyl residues could be detected on the N- 
glycans of these plants. The peak at 2235 found in WT describing 
the (GF) (GF)XF structure was shifted to mass peak of 1665 in the 
20 N-glycan pattern of the transgenic, plants confirming the 
activity of the human beta 1,4 galactosyltransf erase in 
Physcomitrella patens as well as the loss of 1,3 linked fucosyl 
and 1,2 linked xylosyl residues. 

25 8.1 Purification and analysis of recombinant human VEGF i 2X 
expressed in transiently transformed Physcomitrella protoplasts 

Protoplasts derived from protonema of tansgenic Physcomitrella 
plants containing human beta 1,4 galactosyltransf erasen and no 
alpha 1,3 fucoslytransf erase nor beta 1,2 xylosyltransf erase 
30 (7.2) were transformed with pRTlOlTPVEGF C3. Expressed VEGF121 
was secreted into the medium. After two, three and four days the 
culture medium was collected and replaced by fresh medium. 
Samples were filtered through a 0.22 microm Miiiex GP filter 
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' unit (Millipore) . Recombinant VEGF121 was purified by FPLC 
(Aktaexplorer 100, Amersham Biosciences,. Germany) using a SP- 
Sepharose column (Aruersham Biosciences, Germany) . Elution was 
performed with sodium chloride in 25 mM Na-acetate, pH 5:0. 
5 Eluted fractions were concentrated by centrifugation using 
Amicon Ultra filter devices, cut off 10.000 (Millipore). 

SDS/PAGE loading buffer was added to the samples, proteins were 
fractionated on SDS/PAGE containing 15 % acrylamide and 0.4 % 

10 bisacrylamide . Gels were stained with Coomassie Brilliant Blue 
R-250. VEGF121 band was excised from Coomassie stained SDS/PAGE 
and digested with sequencing grade trypsin. The glycans were 
released by treatment with peptide :N-glycosidase A and analysed 
by MALDI-TOF mass spectrometry on a DYNAMO (Thermo BioAnalysis, 

15 Santa. Fe, NM) . 

Detected N-glycan pattern of secreted recombinant VEGF-121 was 
. similar to that observed in transgenic GalT (with knockouts of. 
FucT and XylT, see 7.2) plants confirming the activity of the 
20 human beta 1,4 galactosyltransf erase in Physcomitrella patens as 
well as the loss of 1,3 linked fucosyl and 1,2 linked xylosyl 
residues . 

The following Tab. 1 shows the N-glycan structures of 
•25 Physcomitrella patens wildtype (wt) and knock out plants. N- 
glycans were isolated from plant material grown under same 
conditions (500 ml flasks, modified Knop medium) . F = fucosyl 
residue, G = glactosyl residue, Gn = N-acetylglucosaminyl 
residue, M / Man = mannosyl residue, X = xylosyl residue. 
30 (GF) (GF)XF depicts the most complex type N-glycan, the so called 
Lewis a formation. 
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CLAIMS 

1. A transformed bryophyte cell that comprises i) a 
dysfunctional fucosyl transferase nucleotide sequence . and ii) a 

5 dysfunctional xylosyl transferase nucleotide, sequence. 

2. A transformed bryophyte cell according to claim 1 further 
comprising a nucleotide sequence operably linked to an exogenous 

.promoter that drives expression in the said bryophyte cell 
10 wherein said nucleotide sequence encodes a functional mammalian 
galactosyl transferase that is expressed in the bryophyte cell. 

3. A. transformed bryophyte cell according to claim 1 or claim 2 
wherein the mammalian galactosyl transferase that is expressed 

15 is a beta-1,4 galT. 

4. A transformed bryophyte cell according to any one of claims 1 
to 3 wherein the mammalian galactosyl transferase that is. 
expressed is a human beta-1,4 galT. 

20 

5. A transformed bryophyte cell according to any one of claims .1 
to 4 wherein the said cell further comprises a nucleotide 
sequence operably linked to an exogenous promoter that drives 
expression in the said bryophyte cell wherein said nucleotide 

25 sequence encodes a mammalian glycosylated polypeptide that is 
expressed in the bryophyte cell. 

6. A bryophyte cell according to claim 5 wherein the bryophyte 
cell is selected from species of the genera Physcomitrella, 

30 Funaria, Sphagnum, Ceratodon, Marchantia and Sphaerocarpos . 

7. A bryophyte cell according to claim 6 wherein the bryophyte 
cell is selected from Physcomitrella. 



35 
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8. A bryophyte cell according to claim 7 wherein the bryophyte 
cell is from Physcomitrella patens. 

9. A bryophyte cell according to claim 8 wherein the mammalian 
5 .glycosylated polypeptide is selected from the group comprising a 

polypeptide having a primary amino acid sequence of a human 
glycosylated polypeptide, a primary amino acid sequence of a- 
non-human mammalian glycosylated protein, a primary amino acid 
. sequence of an antibody or an active fragment thereof, and/or a 
10 primary amino acid sequence of a non-mammalian glycosylated 
polypeptide - 

10. A bryophyte cell according to any one of claims 1 to 9 
wherein the mammalian glycosylated polypeptide is a human 

15 polypeptide. 

11. A bryophyte cell according to claim 9 or claim 10 wherein 
•the mammalian glycosylated polypeptide is selected from the 

group consisting of human insulin, preproinsulin, VEGF, 
20 proinsulin, glucagon, interferons such as alpha-interf eron, 
beta-interf eron, gamma-interf eron, blood-clotting factors 
selected from Factor VII, VIJI, IX, X, XI, and XII, fertility 
• hormones including" luteinising hormone, follicle stimulating 
hormone growth factors including epidermal growth factor, 
25. platelet-derived growth factor, granulocyte colony stimulating, 
prolactin, oxytocin, thyroid stimulating hormone, adrenocortico- 
tropic hormone, calcitonin, parathyroid hormone, somatostatin, 
erythropoietin (EPO) , and enzymes such as beta-glucocerebro- 
sidase, haemoglobin, serum albumin, and collagen. 

30 

12. A method of producing at least a bryophyte cell wherein fucT 
and xylT activity is substantially reduced that comprises 
introducing into the said cell i). a first nucleic acid sequence 
that is specifically targeted to an endogenous fucosyl 

35 transferase nucleotide sequence and ii) introducing into the 
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said cell a second nucleic acid sequence that is specifically 
targeted to an endogenous xylosyl transferase nucleotide 
sequence. 

5 13. A method according to claim 12 further ^comprising 
introducing into the said cell an isolated nucleic acid sequence 
that comprises nucleic acid operably linked to an exogenous 
promoter that drives expression in a bryophyte cell wherein said 
nucleic acid encodes a functional mammalian galactosyl 
10 transferase polypeptide . 

14. A method according to claim 12 or claim 13 wherein the 
galactosyl transferase nucleotide sequence is a beta-1,4 
galactosyl transferase (beta-1,4 galT) nucleotide sequence.. 

15 

15. A method according to claim 14 wherein the galactosyl 
transferase nucleotide sequence is a human beta-1,4 galactosyl 
transferase (beta-1,4 galT) nucleotide sequence. 

20 16. A method according to any one of claims 12 to 15 wherein the 
said transformed bryophyte cell further comprises a nucleotide 
sequence operably linked to an exogenous promoter that drives 
expression in the said bryophyte cell wherein said nucleotide 
sequence encodes a mammalian glycosylated polypeptide that is 

25 expressed in the bryophyte cell. 

17. A method according to any one of claims 12 to 16 wherein the 
mammalian glycosylated polypeptide is selected from the group 
comprising a protein having a primary amino acid sequence of a 
30 human protein, a primary amino acid sequence of a non-human 
mammalian protein, a primary amino acid sequence of an antibody 
or an active fragment thereof, and/or a primary amino acid 
sequence of a non-mammalian protein. 

35 18. A method according to any one of claims 12 to 17 wherein the 
glycosylated polypeptide is selected . from the group consisting 
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of human insulin, preproinsulin, proinsulin, glucagon, 
interferons such as alpha-interf eron, beta-interf eron, gamma- 
interferon, - blood-clotting factors selected from Factor VII, 
VIII, IX, X, XI, and XII, fertility hormones including 
5 luteinising hormone, follicle stimulating hormone growth factors 
including epidermal growth factor, platelet-derived growth 
factor, granulocyte- colony stimulating, prolactin, oxytocin, 
thyroid stimulating hormone, adrenocorticotropic hormone, 
calcitonin, parathyroid hormone, somatostatin, erythropoietin 

10 (EPO) , and enzymes such as beta-glucocerebrosidase, haemoglobin, 
serum albumin, collagen, and human and non-human proteins 
selected from amidases, amylases, carbohydrases, cellulase, 
dextranase, esterases, glucanases, glucoamylase, lactase, 
lipases, pepsin, • peptidases, phytases, proteases, pectinases, 

15 casein, whey proteins, soya proteins, gluten and. egg albumin. 

19. A method according to any one of claims 12 to 18 wherein the 
bryophyte cell is selected from species of the genera 
Physcomitrella, . Funaria, Sphagnum, Ceratodon, Marchantia and 

20 Sphaerocarpo's . 

20. A method according to claim 19 wherein the bryophyte cell 
is selected from Physcomitrella. 

25 21. A method according to claim 20 wherein the bryophyte cell is. 
from Physcomitrella patens. 

22. A method according to any one of claims 12 to 21 wherein the 
mammalian glycosylated polypeptide is selected from the group 

30 comprising a polypeptide having a primary amino acid sequence of 
a human glycosylated polypeptide, a primary amino acid sequence 
of a non-human mammalian glycosylated protein, a primary amino 
acid sequence of an antibody or an active fragment thereof, 
and/or a primary amino acid sequence of a non-mammalian 

35 glycosylated polypeptide. 
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23. A method according to any one of claims 19 to 22 wherein the 
mammalian glycosylated polypeptide is a human polypeptide. 

24. A method according to any one of claims 19 to 23 wherein the 
5 mammalian glycosylated polypeptide is selected from the group 

consisting of human insulin, preproinsulin, VEGF, proinsulin, 
glucagon, interferons such as alpha-interf eron, beta-interf eron, 
gamma-interf eron, blood-clotting factors selected from Factor 
VII, VIII, IX, X, XI, and XII, fertility hormones including 

10' luteinising hormone, follicle stimulating hormone, growth factors 
including epidermal growth factor, platelet-derived growth 
factor, granulocyte colony stimulating, prolactin, oxytocin, 
thyroid stimulating hormone, adrenocorticotropic hormone> 
calcitonin, parathyroid hormone, somatostatin, erythropoietin 

15 (EPO) , and enzymes such as beta-glucocerebrosidase, haemoglobin, 
serum albumin, and collagen. 

.25. A method according to any one of claims 12 to 24 wherein the 
exogenous promoter is selected from inducible, chemical- 
20 regulated, constitutive or cell specific promoters. 

26. An isolated polynucleotide that encodes a functional 
mammalian glycosyl transferase for use in a method according to 
any one of claims 12 to 25. 

25 

27. An isolated polynucleotide according to claim 26 that 
encodes a recombinant mammalian galactosyl transferase for use 

•in a method according to any one of claims 12 to 26. 

30 28. An isolated polynucleotide according to claim 26 that 
encodes a recombinant human beta-1,4 galactosyl transferase for 
use in a method according to any one of claims 12 to 27. 

29. A nucleic acid vector suitable for transformation of a 
35 foryophyte cell and including a polynucleotide according to any 
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.one of claims 26 to 28. 

30. A host cell containing a heterologous polynucleotide or 
nucleic acid vector according to any one of claims 26 to 29. 

5 

31. A host cell ' according to claim 30 which is a bryophyte cell. 

32. A host cell according to claim 29 which is a prokaryote 
cell, 

10 

33. A method of producing a host cell according to any of claims 
30 to 32, the method including incorporating said polynucleotide 
or nucleic acid vector into the cell by means of transformation. 

15 34. Use of a polynucleotide according to any one of claims 2 6 to 
28 in the production of a transgenic bryophyte cell. 

35. A host cell according to claim 30 or 31 which is comprised, 
in a bryophyte, or a bryophyte part, or an extract or derivative 

20 of a bryophyte or in a bryophyte cell culture. 

36. A bryophyte plant or bryophyte tissue comprising a- bryophyte 
cell according to any one of .claims 30, 31, or 35. 

25 37. A method of producing a bryophyte plant, the method 
including incorporating a polynucleotide or nucleic acid vector 
according to any of claims 26 to 29 into a bryophyte cell arid 
regenerating a bryophyte from said cell. 
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SEQUENCE LISTING 

<110> greenovation Biotech GmbH 

<120> improvements in or . relating to protein production 
<130> Protein Production Method 
<140> 1041 WO 
<160> 89 

<170> Patentin ver. 2.1 

<210> 1 
<211> 36 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence MoB323 
<400> 1 

atactcgagg aagatgaact tttctgcctg tcttgg 
36 

<210> 2 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence MOB349 
<400> 2 

ctgccatggg tgcagcctgg gaccac 
26 

<210> 3 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence GNT(d)l 
<400> 3 

gtngcngcng tngtngtnat ggc 
23 

<210> 4 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence GTN(d)3 
<400> 4 

ccyttrtang cngcnctgng gnacncc 
27 

<210> 5 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence GTN (d) 2 
<400> 5 

tayaaratnc agncaytaya artgg 
25 

<210> 6 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence GTN(d)4 
<400> 6 

arrtaytgyt traaraaytg ncc 
23 

<210> 7 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5RACEG3 
<400> 7 

gtccgtgtcc aataaaggag 
20 

<210> 8 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5RACEG4 
<400> 8 

gtcgggagag atttccatgt c 

<210> 9 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5RACEG5 
<400> 9 

ctaagatgac gacccttcgg 
20 

<210> 10 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5RACE6 
<400> 10 

catcctgaga aacaaaaagt gg 
22 

<210> 11 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5RACE7 
<400> 11 

agttacagac ttcaatgtac g 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence - 
<223> Description of Artificial Sequence: Primer sequence 5RACE8 
<400> 12 

aatcaggacg gttgcaagcc 
20 

<210> 13 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 3RACEG1 
<400> 13 

ttatccgacc tgaagtttgc 
20 

<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial sequence: Primer sequence 3 R AC EG 2 
<400> 14 

gacctacaat tttggagagc 
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20 

<210> 15 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence GNT5F 
<400> 15 

tgggctttaa cacaactttt 
20 

<210> 16 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence GTN6R 
<400> 16 

gccctaagct tgatccctg 
19 

<210> 17 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<223> Description or Artificial Sequence: Primer. sequence GNT21F 
<400> 17 

atggcagata tggctcgatt g 
21 

<210> 18 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence GNT15R 
<400> 18 

agtttctatg gtatctaact gc 

<210> 19 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<223> Description or Artificial Sequence: Primer sequence GNTHT7 
<400> 19 

cjagcatccaa gcttgacctg g 

<210> 20 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<223> Description or Artificial Sequence: Primer sequence GNTET7 
<400> 20 

gcaccgtgaa ttcttctagc tt 

<210> 21 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence GNTHT3 
<400> 21 

ggaagaacaa gcttcaaagt ggc 
23 

<210> 22 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: Primer sequence GNTPT3 
<400> 22 

gatccctgca gatctcaaac g 



<210> 23 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence GNT7F 
<400> 23 

gttcsatggt ttgagcagg 
19 

<210> 24 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial sequence: Primer sequence GNT8R 
<400> 24 

gcgacctttc ctattctcc 
19 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence Nl 
<400> 25 

taccgacagt ggtcccaaag 
20 

<210> 26 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence N2 
<400> 26 . 

ccaccatgat attcggcaag 
20 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence GNT5F 
<400> 27 

tcjggctttaa cacaactttt 

<210> 28 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence N3 
<400> 28 

tgtcgtgctc caccatgtt 

<210> 29 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<223> Description or Artificial Sequence: Primer sequence N4 
<400> 29 

gtt^cata taagaaac 



<210> 30 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence GNTlOR 
<400> 30 

cacattgttc aatttgatag ac 
22 

<210> 31 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence fd4f 
<400> 31 

tgggcngart aygayatgat g 

<210> 32 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence FDRl 
<400> 32 

tgngtnarnc cnadnggrta dat 
23 

<210> 33 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence FD5R 
<400> 33 

tgnacngcng ccatrtc 
17 

<210> 34 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5FT4 
<400> 34 

gtaacattcg cataatgg 

<210> 35 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5ft5 
<400> 35 

cgatcattat gcgcaccac 
19 

<210> 36 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5FT6 
<400> 36 
gaaataaaa gcagctcc 



?! 



8 

<210> 37 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5FT7 
<400> 37 

agggtgaatc tccatagcc 
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19 

<210> 38 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial sequence: Primer sequence 5FT8 
<400> 38 

catctgcctg accctcacc 
19 

<210> 39 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5FT9 
<400> 39 

cjccttgaaca cgcatggc 

<210> 40 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5FT10 
<400> 40 

cgatacaacc agcacagg 
18 

<210> 41 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence:. Primer sequence 5FT11 
<400> 41 

cttctctagc cattctgcc 
19 

<210> 42 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 3FT1 
<400> 42 

gcagtggaag tttaatggtc 
20 

<210> 43 
<211> 21 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence: Primer sequence 3FT2 
<400> 43 

tcgtttctag ctctagtaga c 
21 

<210> 44 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence FTA9F 
<400> 44 

atgctcccag cccaagac 
18 

<210> 45 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: Primer sequence FTAlOR 
<400> 45 

tcjtctactag agctagaaac g 

<210> 46 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description or Artificial sequence: Primer sequence ft18f 
<400> 46 

tagggagtaa atatgaaggg 
20 

<210> 47 
<211> 21 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence: Primer sequence FTA12R 
<400> 47 

tacttccaat tgaagacaag g 

<210> 48 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence ft15f 
<400> 48 

aatgttctgt gccatgcg 
18 

<210> 49 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence FJ16R 
<400> 49 

tgcttcaaat gggctaggg 

<210> 50 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

<223> Description or Artificial Sequence:. Primer sequence nptll/Ndel-F 
<400> 50 

atgccatatg gcatgcctgc aggtcaac 
28 

<210> 51 
<211> 29 
<212> DNA 

<213> Artificial sequence 

<223> Description or Artificial sequence: Primer sequence nptli/Bstzl7l-R 
<400> 51 

gcatgtatac gcatgcctgc aggtcactg 
29 

<210> 52 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence FT14F 
<400> 52 

acaaagttac atactcgcg 
<210> 53 
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<211> 20 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence: Primer sequence RlO . 
<400> 53 

tctttcacta ttcggtgacg 
20 

<210> 54 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence Rll 
<400> 54 

cgaccacaac attagatcc 
19 

<210> 55 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence XDFl 
<400> 55 

ttjygargsnt ayttyggnaa ygg 

<210> 56 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence XDRl 
<400> 56 

gcncknayca tytcnccraa ytc 
23 

<210> 57 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial sequence: Primer sequence XDF2 
<400> 57 

ggnggngara arytngarra ngt 
23 

<210> 58 

<211> 18 - 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence: Primer sequence 5XTl 
<400> 58 

tcctccttct ctgggacc 
18 

<210> 59 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5xt2 
<400> 59 

agctccagtt gtgaaatatg g 

<210> 60 
<211> 18 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence: Primer sequence 5XT4 
<400> 60 

cttccttcac cacactac 
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<210> 61 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<223> Description of Artificial 
<400> 61 

tagcatgact gtgtggcc 
18 

<210> 62 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<223> Description of Artificial 
<400> 62 

aaaggcttga gtgtagcc 
18 

<210> 63 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<223> Description of Artificial 
<400> 63 

gcctttcttg cacgggttg 
19 

<210> 64 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<223> Description of Artificial 
<400> 64 

ggacattcca aataatccc 
19 

<210> 65 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<223> Description of Artificial 
<400> 65 

gtcctgttaa atgccttgc 

<210> 66 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<223> Description or Artificial 
<400> 66 

aggttgagca atcatatggc 
20 

<210> 67 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<223> Description of Artificial 
<400> 67 

atcccagaaa tatctgatcc 
20 
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Sequence: Primer sequence 5xt5 



Sequence: Primer sequence 5XT6 



Sequence: Primer sequence 3XT1 



Sequence: Primer sequence 3xt2 



Sequence: Primer sequence XT15R 



Sequence: Primer sequence XT-MlF 



Sequence: Primer sequence XTllR 



<210> 68 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 
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<223> Description of Artificial Sequence: Primer sequence XT12F 
<400> 68 

tgtgaggcgt tctttggc 

<210> 69 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence nptil/Sall-F 
<400> 69 

atgcgtcgac gtcaacatgg tggagcacg 
29 

<210> 70 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence nptil/Ndel-R 
<400> 70 

gcatcatatg tcactggatt ttggttttag g 

<210> 71 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence XT13R 
<400> 71, 

acgatccaaa atctggacgc 
20 

<210> 72 
<211> 32 
<212> DNA 

<213> Artificial Sequence 

<223> Description or Artificial sequence: Primer sequence Galrxh-F 
<400> 72 

ttctcgagac aatgaggctt cgggagccgc tc 

<210> 73 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: primer sequence GalTXb-R 
<400> 73 

ggtctagact agctcggtgt cccgatgtcc 
30 

<210> 74 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<223> Description or Artificial sequence: Primer sequence xtb-f 
<400> 74 

ttggatcctc aattacgaag cacaccatgc 
30 

<210> 75 
<211> 32 
<212> DNA 

<213> Artificial Sequence 

<223> Description or Artificial Sequence: Primer sequence XTB-R 
<400> 75 

ttggatcctc ctcccagaaa catctgatcc ag 
<210> 76 
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<211> 28 
<212> DNA 

<213> Artificial sequence 

<223> Description or Artificial sequence: Primer sequence MOB521 
<400> 76 

ttgccgctat ctacttgtat gctaacgt 
28 

<210> 77 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence MOB575 
<400> 77 

tgccgtggat gtgctagata atctt 

<210> 78 
<211> 33 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence FTB-F 
<400> 78 

taggatccag atgatgtctg ctcggcagaa tgg 

<210> 79 
<211> 33 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence: Primer sequence FTB-R 
<400> 79 

ctggatcctt gtagatccga aggtctgagt tec 

<210> 80 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence mob435 
<400> 80 

tcctacctgc ggagcaacag atattg 
26 

<210> 81 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence MoB495 . 
<400> 81 

gtggacccag atttgctggt gcacttg 
27 

<210> 82 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence XT14F 
<400> 82 

ttacgaagca caccatgc 

<210> 83 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence 5XT3 
<400> 83 

ttcttcctca tttcgtccc 



19 
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<210> 84 
<211> 35 
<212> DNA 

<213> Artificial Sequence 

< ?22 > Description or Artificial Sequence: Primer sequence MoB558 
<400> 84 

gttccgcggt gatcccgttt tcatatcagt gtatt 

<210> 85 
<211> 36 
<212> DNA 

<213> Artificial Sequence 

Description of Artificial Sequence: Primer sequence mob557 
<400> 85 

tttgagctct acgtaacaat aacataaaat atcaca 
36 

<210> 86 
<211> 815 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: PCR product 
<400> 86 

gttccgcggt gatcccgttt tcatatcagt gtattatcat cagtgactgc atattgacac 

ccaattctga tgatttttta ttttttattt tttatttttt ttggtatggt tacatgcttt 

tcagaggttt ctatgccgct gagtattttc ctgaatcgcg aggtgtgaca ggttatctgc 
180 

gccgtccacc caatatttta tgatgagtcg atgattcgtg agactaatct agcttaacct 

ttttcttact ggcaagtcaa aattgagttt aaaatatttc agtatcctgt tagtaatttc 

agacacatgt attctatgtc tcatactctt tacgtgaaag ttcaactgac ttatattttg 

tcgtttttct gtagatcact gttttagcgc atacaaagac aattgtctaa atatttttaa 
420 

agaaggtgat attttattat aagatagaag tcaatatgtt tttttgttat gcacatgact 

tgaataaaat aaattttttt gttagattta aatacttttt gaattatagc tttgttgaaa 
540 

ttaaggaatt tatattcata agaagctact cgaacaaatt tacaaagaga acatttgata 
600 

agtaaaagta attaaaagtt ttttttaatt taaaaagatt aatttttatt aataagaaga 

acttggaaag ttagaaaaat atttaacttt aaaaattaag aaaacaaggc aaaactttaa 

tttacaaata cttaatgtag attaattttc ttattatata ttagcacaaa ttatcattat 
780 

gtgatatttt atgttattgt tacgtagagc tcaaa 

<210> 87 
<211> 34 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer sequence MOB555 
<400> 87 

cgcgttaact ctctctatct ctctctgtgt tgcg 
34 

<210> 88 
<211> 32 
<212> DNA 

<213> Artificial Sequence 

<223> Description or Artificial Sequence: Primer sequence MoB556 
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<400> 88 

cgagaattct cacttagaag aagcccaatc ct 

<210> 89 
<211> 882 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: PCR product 
<400> 89 

cgcgttaact ctctctatct ctctctgtgt tgcgtttgat caggggtttt agggtttggg 
60 

tccagggttc cgaggagtat cgtcacgtgt attgcggtct tgttggagat tcctcagttg 

tgcatgtaga tataaactta gtttagtcca cgatcggttt ctaatcgtgg atttttgtgg 
180 

gtttcggtcg ttgagcaaga attttgtgaa ttttttgtat tgggggaagg aaatggggtt 
240 

atggcgatat cgttttcgtt gggttcaacg tgatcggtga gctccaggaa gggctggtca 

ctcacaatcc ggtattcgtc tcatcgagac gcatttatcg gttcattata tgtatatata 

tatatatata tatatgcaga gtcgattgtg ttgcaatttc tgaactaggt actgttgaat 
420 

tgtagattgc cttcaagtag ctctcgatgt tggaatgacg sacacaaatt ctgctactga 

atgagaccat attctgcacc gttaattggt tttatgaata tatggtgtcg aattacattc 

tgtctcgaat ccatgcgccc tttctgcacg aacgttggtt tgtagttgta gtgcagccag 
600 

tgtgtttggt ttaggattat gctttgacga tcgatgagtc cgtttcatgg ttttatactt 
660 

gtcatttatc ttcttgtgat tttttgttta caaatgttcc cccaattgta acgtgggact 

ttxgtgtgtg gtggttgctc aaattgatag ttttggtcat ttgatttgcg gagagcaatc 
780 

ggtgtcatgg aaaatccctt cgactgcttt gatccaatca aagttctgct tgagccaatg 
840 

tgagaggtgg aggattgggc ttcttctaag tgagaattct eg 
882 



